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No easy answer 


Demands to analyse Connecticut school shooter’s DNA are misguided and could lead to dangerous 


stigmatization, or worse. 


analysis of mass killer Adam Lanza, who shot 20 children, 

6 school staff, his mother and himself in Newtown in Decem- 
ber. At first glance, it is easy to understand why. Confronted with such 
senseless violence, it is human nature to seek solace in scientific expla- 
nations. After John Wayne Gacy was executed in 1994 for the murder 
of 33 young men and boys, his brain was preserved and examined for 
clues to what made him a monster. More than 80 years ago, scientists 
reportedly studied the brain of serial killer Peter Kiirten, the ‘vampire 
of Dusseldorf’ who was executed in 1931. 

This quest to understand endures as technology advances. Now, 
instead of looking at cranial folds and frontal lobes for clues to the 
massacre, geneticists at the University of Connecticut in Farmington 
will scour Lanza’s genes. On its own, this hunt will be about as informa- 
tive as studies of the brains of murderers: not very. 

The Connecticut scientists will not talk about the job they have been 
handed. It is not clear what they will find, or even what they should look 
for. Suspend disbelief for a moment and pretend that a ‘mass-shooter 
gene’ exists — something that no serious geneticist believes — and sci- 
entists could still draw no conclusions froma single individual’s genome. 

To be sure, many links and suggestions of links have been identified 
between genetics, mental illness and, to a lesser extent, violence. A study 
using Swedish registries (R. Kuja-Halkola et al. Dev. Psychopathol. 24, 
739-753; 2012) found that children born to men older than 60 were 
more likely to be convicted of violent crimes than were those born to 
men aged 40-60 years, an observation that might be linked to increasing 
numbers of mutations in sperm as men age. Genetic risk factors have 
been identified for autism, depression and schizoid spectrum disorders, 
but they explain relatively little. People diagnosed with schizoid spec- 
trum disorders are more likely to be convicted of violent crimes than 
are those with no such diagnosis, but the vast majority of people with 
mental illness do not commit crimes. 

Such associations hold only for groups. Many healthy people carry 
mutations associated with disease; many people with mental illness 
carry no known risk factors. Mass shooters are often young white 
men, yet very few young white men become mass shooters. There is 
no one-to-one relationship between genetics and mental health or 
between mental health and violence. Something as simple as a DNA 
sequence cannot explain anything as complex as behaviour. 

But there is a dangerous tendency to oversimplify, especially in the 
wake of tragedy. If Lanza’s DNA reveals genetic variants — as it inevi- 
tably will — people who carry similar variants could be stigmatized, 
even if those variants are associated only with ear shape. If Lanza has 
genetic variants already associated with autism or depression, people 
with those diseases could come under suspicion as well. 

The real risk here, and the real flaw in the Connecticut exercise, is 
that to identify a genetic variant is more straightforward — but arguably 
less informative — than to characterize the complex environment of the 


CComsser state medical examiner has requested a full genetic 


individual. Lanzas DNA will be analysed not because it will be useful 
but because it can be analysed. The ease of DNA sequencing will lead to 
a dangerous temptation to focus on minor, even spurious, genetic cor- 
relations at the expense of non-genetic factors that are more influential. 
Geneticists must explain — and in the wake of the Lanza move many 
already are — that the ability to sequence DNA is many steps removed. 
from the ability to make that sequence meaningful. Many, if not most, 
mutations are meaningless outside narrow contexts. One of the most 
robust examples of a variant that has been 


“This hunt will be linked to antisocial behaviour holds only 
about asinformative for individuals who experience severe 
as studies of the childhood trauma or abuse; those who do 
brains of murderers: not face no greater risk of being antisocial 
not very. i than people without the variant. Less- 


studied risk factors presumably work the 

same way. Genetics matters only in the context of environment. 

Research must go beyond the drive to unpick impulses to violence. It 
should consider the means to violence. On the day that Lanza entered 
Sandy Hook school armed with his mother’s guns, another deranged 
man entered Chenpeng Village school in China's Henan province armed 
with a knife. Horrific as that attack was, no one was killed. Yet research 
on how to reduce gun violence in the United States has been hampered 
by pro-gun lobbyists and politicians, who have reduced the necessary 
funding and record-keeping for preventing injuries from firearms. 

Sequencing Lanza’s DNA may bring benefits, but only if appropriate 
expectations are maintained. At best, such studies will prompt deeper 
interest and funding to figure out the biological and social factors that 
really harm. At worst, they will stigmatize those with mental illnesses 
and distract scientists from more-important questions. = 


Culture shock 


Health-benefit claims for Europe’s foods must 
at last be substantiated by science. 


illa daily probiotic yogurt improve your immune defences? 
W will cooking with olive oil boost the levels of ‘good’ 
cholesterol in your blood? 

So far, the food companies behind these particular claims have not 
supplied the hard scientific evidence to convince expert committees of 
the European Food Safety Authority (EFSA) in Parma, Italy, that they 
are warranted. European Union (EU) legislation means that all simi- 
larly unsubstantiated health claims for food will soon be disallowed. 
A register of permissible claims — which will be regularly updated 
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whenever new scientific evidence can be brought to bear — will be 
presented to the European Parliament for inspection next month, and 
adopted by member states later this year. Already, many health-food 
companies, wary of possible litigation, have toned down their market- 
ing claims. 

The regulations, drafted by the European Commission and adopted 
by the EU in 2007, are intended to put an end to the free ride that the food 
and food-supplement industries have enjoyed until now. Companies 
will no longer be able to market their products with unproven promises 
that consumers will be healthier, slimmer or have happier lives. Even the 
words ‘probiotic and ‘antioxidant will disappear from food and food- 
supplement labels in the absence of confirmed specific health benefits. 

To implement the regulations, the EFSA — whose role is to supply 
scientific advice in support of EU policies — set about drawing up 
a register of permitted health claims for food ingredients, asking its 
expert committees to recommend for inclusion only those for which 
the claims have been unambiguously proven in healthy populations. 

Of the 2,927 consolidated health claims for different ingredients 
examined by the EFSA, only 241 passed muster. A second chance 
was given to 91 of the non-approved claims, 74 of which related to 
microorganisms — pre- and probiotics. But the EFSA rejected all of 
the resubmitted dossiers except two: prunes for normal bowel func- 
tion and the polysaccharide a-cyclodextrin, a soluble dietary fibre, for 
limiting the rise in blood glucose after a meal. 

The high rejection rate dismayed sections of the multibillion-euro 
health-food industry, which has lobbied fiercely against this legislation 
with disingenuous arguments that it inappropriately applies pharma- 
ceutical standards to foods. In fact, it is the industry that has tried to 
make pharmaceutical-level claims for its products while bypassing 
medical-registration procedures and costly quality control. 


But there is an elephant in the room: botanicals. Around 2,000 health 
claims for plant-based foods are on hold at the ESFA as manufactur- 
ers plead for special treatment. The manufacturers refer to the 2001 
EU medicines directive which allowed traditional herbal medicines a 
simplified registration procedure not requiring rigorous proof of effi- 
cacy. The EU health commissioner must decide whether plant-based 
products marketed as health-promoting foods should be treated with 
similar leniency, but the decision has been shamefully delayed. The new 

commissioner, Tonio Borg, has been in office 


“Already, many for only a few weeks, but he needs to bring 
health-food sae quickly —and : firmly a wean to 
companies, — is aoa a scientific evi ais 
wary of p ossible ¢ European ourt 0 ; Justice may yet orce 
eae & a decision on the commission if it continues 
litigation, have ; 2 : 
to prevaricate. The court is currently hearing 
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their marketin three cases of unfair competition from manu- 
ehawe = S  facturers stopped from making claims on non- 


botanical food and food supplements. Yogurt 
manufacturers, for example, do not consider it 
fair that botanicals can continue to be marketed with unproven claims of 
improving immune defences when they are no longer allowed to do so. 

Faced with the new legislation, the yogurt industry has in fact 
buckled down to generate the scientific evidence that the commission 
wants. Manufacturers have joined forces to produce a meta-analysis 
of published evidence on some probiotic strains to try to get at least 
some claims onto the permitted list before it circulates to parliament. 
And they are also launching double-blinded clinical studies to prove 
that certain microbial strains have particular effects on health. That is 
an expensive exercise, but it is the price that must be paid by those who 
want to stay in the game. m 


Realities of risk 


We should focus on dangers that we can control, 
and particularly on those of our own creation. 


hen policy-makers consider global risks, they tend to 
WW casoont from headlines. Troubles in the eurozone could 

spiral out of control, conflict in Syria could spark wider 
unrest, the H5N1 bird-flu virus could mutate and spread from person 
to person in a global pandemic. Those dangers and many others are 
clear and present; society is well-advised to prepare for them, and takes 
good advice on how to do so. The World Economic Forum (WEF), for 
one, publishes an annual risk-assessment report. 

There is, however, another category of risk: the unheralded dangers 
that sneak up on us. Many are the unforeseen consequences of progress, 
of humanity’s scientific and technological quests. For its 2013 report, 
the WEF asked the editors and journalists of Nature to identify five of 
the most disruptive of these risks, dubbed ‘X factors’ 

Neuroscientists, for instance, are avidly pursuing drugs and devices 
that could deliver real cognitive enhancement — not just sharpen- 
ing our alertness and ability to focus, as certain drugs already do, but 
upping our intellectual firepower. From students to business executives, 
the demand for such drugs would be huge — and so would the potential 
for an X factor to strike. Few drugs affect just a single target. The neuro- 
transmitter systems important to cognition also serve other functions, 
raising the spectre of serious side effects: for example, a drug that boosts 
memory might also make the user more prone to impulsive behaviour. 
And then there are the ethical conundrums: should the market decide 
who gets the benefits of these drugs and who does not? Should they be 
banned, to level the playing field — or subsidized, for the same reason? 

Climatologists have more mixed feelings about schemes for 
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geoengineering — deliberately altering the climate system to combat 
the effects of rising greenhouse-gas emissions. In one scenario, high- 
flying jets or balloons would release a haze of sulphate particles into 
the stratosphere, dimming the Sun’s rays and cooling the planet. Fear- 
ful that geoengineering could affect the climate system in unexpected 
ways, researchers have deliberated and studied it, and so far proposed 
only the most cautious of experiments. A second X factor looms here: 
geoengineering is simple and cheap enough that a rogue nation, or 
even a company, could deploy it on a large scale before its risks are well 
understood, perhaps triggering a widespread climatic crisis. 

The other X factors that Nature staff identified are no less dramatic: 
the societal burden of the millions of people who, thanks to progress 
against killer diseases, will join the ranks of the disabled and those with 
dementia; catastrophic climate feedback such as the collapse ofan ice 
sheet; and the possible social consequences of contact with alien life 
(although this is perhaps more X-files than X factor). 

In this week’s issue, Nature also takes up related topics. A World 
View on page 135 offers psychological insight into ‘digital wildfires’ (a 
danger explored in Global Risks 2013). These are a much more certain 
danger than the X factors — in fact, they are a regular feature of the 
Internet, flaring up practically by the minute as rumours or personal 
attacks race through cyberspace. Meanwhile, a Comment on page 157 
examines the shortcomings of efforts to forecast state changes such as 
the collapse of ecosystems or the outbreak of epidemics. 

Finally, a Feature on page 154 offers the liberating perspective that 
some of the very worst things that could happen — catastrophes that 
could devastate the biosphere, and human society with it — are out 
of our hands. These strokes of cosmic bad luck, among them super- 
volcanoes, apocalyptic tsunamis driven by undersea landslides, and 
mega solar flares, have all happened in the geo- 
logical past and are sure to strike again some- 
day. But in many cases there is little we can do 
to prepare or stave them off — and so there is 
no point in worrying. = 
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(although this is perhaps more X-files than X factor). 
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Internet, flaring up practically by the minute as rumours or personal 
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examines the shortcomings of efforts to forecast state changes such as 
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society identified this week by the World Economic Forum in 

its Global Risks report — is well chosen. The image highlights 
the risks of the rapid spread of uncontrollable and destructive informa- 
tion — risks that are made possible by an ever-growing worldwide web 
of communication channels. 

Many digital wildfires start as rumours, and social psychologists have 
long studied how rumours spread. The results of this research can there- 
fore help in the prevention of and response to digital wildfires. 

Why is this important? A digital wildfire ignited by a humorous 
complaint about United Airlines posted on YouTube by musician Dave 
Carroll saw the company’s stock drop by 10%. A false Internet rumour 
that Syrian President Bashar al-Assad had been killed caused oil prices 
to rise. These are not trivial matters. 

Research on rumour has come up with four concepts that may be 
applicable to digital wildfires: motivation, situa- 
tion, narrative context and trust. 

The spreading of rumours is an act of goal-ori- 
ented communication, often motivated by a desire 
to find out whether they are true. Other common 
motives relate to the sender-receiver relationship: 
to please or entertain the receiver and to make the 
receiver like or respect the sender. Self-oriented 
aims include: fulfilment of the sender’s wishes; the 
boosting of self-esteem (or defence against threats 
to one’s sense of self, cherished values or group 
identity); the exacting of revenge; and the achieve- 
ment ofa strategic goal (as with propaganda). 

Situation matters, too. Just as forests are especially vulnerable to fire 
during drought, so too some situations make digital networks suscep- 
tible to wildfires. Ambiguity and threats to assets, both tangible (life, 
health or wealth) and psychological (group honour or cherished values) 
drive fact-finding rumours. Situations of conflict or rivalry produce 
strong defensive sentiments that welcome negative rumours about a 
rival group and positive rumours about our own. Perceived injustice 
often prompts people to take revenge. 

Rumours reflect and resonate with larger cultural, historical and 
ideological narrative contexts, which render the claims more plausible 
and understandable to some groups than others. A YouTube trailer that 
defamed the Prophet Muhammad as a murderer, womanizer, bisexual 
and paedophile resonated with the widespread Islamic narrative of 
Westerners as desecrating, condescending and shameless barbarians 
who derogate the sacred and threaten moral order; violent reactions to 
it throughout the Muslim world cannot be understood without taking 
account of this context. 


r Ahe metaphor ‘digital wildfire’ — one of numerous threats to 
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Rumour research can 
douse digital wildfires 


Work on how rumours arise and spread could help to dampen the effects of 
damaging misinformation circulating on the Internet, says Nicholas DiFonzo. 


refutation or response. Research has shown that, under conditions of 
interdependence and equal status, increased contact with members of 
rival groups decreases negative stereotyping and increases trust. 

But in the digital world, rival groups tend to have little contact with 
each other. Recruiting members from opposing factions to projects 
with shared goals may therefore help to ease these tensions. In my own 
laboratory, when groups of strong Republicans and Democrats commu- 
nicated online to accomplish a task, fewer believed polarizing rumours 
such as “Democrats abuse drugs” and “Republicans are racist”. 

These strands of rumour psychology can help us to understand digital 
wildfires. Why did Carroll’s YouTube complaint that United Airlines 
broke his US$3,500 guitar, then refused to pay for repairs go viral? First, 
Carroll is a professional singer and his video was clever, humorous and 
entertaining; passing it along to friends and co-workers enhanced 
sender-receiver friendships. 

Second, he tapped into a wellspring of distrust: 
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CLIMATE SCIENCE 


West Antarctic 
warming hotspot 


West Antarctica warmed by 
about 2.4°C between 1958 
and 2010, making it one of the 
fastest-warming areas of the 
planet. 

Previously reported 
temperature trends in 
West Antarctica have been 
disputed, and records in 
the region are sparse. David 
Bromwich at Ohio State 
University in Columbus 
and his colleagues analysed 
the most complete set of 
records available, from the 
Byrd Station US research 
outpost, and filled in missing 
observations with modelling 
and data from other sites. 
Their analysis shows warming 
in West Antarctica in the 
spring, summer and winter. 

Despite the lack of evidence 
for increased surface melting 
in this region so far, a 
continued rise in summer 
temperatures could boost the 
chances of more frequent and 
intense melting events, the 
authors say. 
Nature Geosci. http://dx.doi. 
org/10.1038/ngeo1671 (2012) 


| GENOMICS 
Single-cell 
sequencing 


Sequencing a genome 
usually requires DNA from 
thousands or even millions 
of cells, but a technique now 
allows more than 90% of the 
genome ofa single cell to be 
sequenced. This could enable 
studies, for example, on how 
mutated cancer cells emerge 
or how individual neurons 
differ. 

Sunney Xie at Harvard 
University in Cambridge, 
Massachusetts, and his 
colleagues developed the 
method, which uses short 


Selections from the 
scientific literature 


ANIMAL BEHAVIOUR 


Bisexuality boosts attractiveness 


Female fish find some males more attractive if 
they have seen the males engaging in mating 
behaviour, even when such behaviour was with 


other males. 


David Bierbach and his colleagues at the 
University of Frankfurt in Germany assessed 
the preferences of female Atlantic mollies 
(Poecilia mexicana) for various males of the 
same species (pictured) by showing them 
video animations of the males and measuring 
the time they spent in proximity to the 
images. Females preferred colourful males at 
first, but showed an increased preference for 
drabber males that they witnessed engaging 


DNA molecules called 
primers. These molecules are 
added to DNA that has been 
isolated from a single cell, 

and that stick to its strands 

to act as starting points for 
DNA replication. The primers 
are designed to reduce the 
excessive copying of some 
portions of the genome at 

the expense of others — a 
problem that has plagued 
other attempts at single-cell 
sequencing. 

Science 338, 1622-1626 (2012) 
For a longer story on this research, 
see http://go.nature.com/fdvwb2 
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in either homosexual or heterosexual 
mating behaviour. Another experiment 
confirmed that the females could tell the 


difference between male and female fish in the 


animations. 


Most powerful 
quasar yet 


Astronomers have discovered 
a quasar that is expelling 
material 10 times more 
powerfully than any previously 
observed. 

The quasar — the 
luminous centre of a galaxy 
powered by a supermassive 
black hole — each year blasts 
out material equivalent 
to the mass of 400 Suns. 
Benoit Borguet at Virginia 


The authors suggest that this increased 
female preference for sexually active males 
may explain the prevalence of bisexuality 
in some group-living species. Homosexual 
behaviour, they say, confers a reproductive 
advantage on males of those species by 
increasing their likelihood of future 
heterosexual mating. 

Biol. Lett. 9,20121038 (2013) 


Polytechnic Institute 

and State University 

in Blacksburg and his 
colleagues spotted the object, 
dubbed SDSS J1106+1939, 
in April 2011 using the Very 
Large Telescope, a European 
Southern Observatory 
facility on Cerro Paranal in 
Chile. The quasar’s existence 
helps to strengthen models 
of galactic evolution, which 
use outflows of supermassive 
black holes as a feedback 
mechanism to constrain star 
formation. 

Astrophys J. 762, 49 (2013) 
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in either homosexual or heterosexual 
mating behaviour. Another experiment 
confirmed that the females could tell the 


difference between male and female fish in the 


animations. 


Most powerful 
quasar yet 


Astronomers have discovered 
a quasar that is expelling 
material 10 times more 
powerfully than any previously 
observed. 

The quasar — the 
luminous centre of a galaxy 
powered by a supermassive 
black hole — each year blasts 
out material equivalent 
to the mass of 400 Suns. 
Benoit Borguet at Virginia 


The authors suggest that this increased 
female preference for sexually active males 
may explain the prevalence of bisexuality 
in some group-living species. Homosexual 
behaviour, they say, confers a reproductive 
advantage on males of those species by 
increasing their likelihood of future 
heterosexual mating. 
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Polytechnic Institute 

and State University 

in Blacksburg and his 
colleagues spotted the object, 
dubbed SDSS J1106+1939, 
in April 2011 using the Very 
Large Telescope, a European 
Southern Observatory 
facility on Cerro Paranal in 
Chile. The quasar’s existence 
helps to strengthen models 
of galactic evolution, which 
use outflows of supermassive 
black holes as a feedback 
mechanism to constrain star 
formation. 
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D. BIERBACH 


THE LANCET/ELSEVIER 


A galaxy far, 
far away 


Using the Hubble Space 
Telescope, astronomers have 
detected light from the most 
distant object yet found — a 
fledgling galaxy that existed 
when the Universe was just 
over 420 million years old. 
The discovery shows that stars 
were organizing into galaxies 
when the Universe was just 3% 
of its current age of 13.7 billion 
years. Dan Coe of the Space 
Telescope Science Institute in 
Baltimore, Maryland, and his 
colleagues spotted light from 
the galaxy by taking advantage 
of an intervening lens of dark 
matter that magnified the 
object. Coe says that this and 
other discoveries suggest that 
light from galaxies might be 
responsible for ionizing the 
early Universe. 

Astrophys J. 762, 32 (2013) 


NEUROPROSTHETICS 


Fine mind control 
of machine 


A woman paralysed in all four 
limbs has shown that she can 
move a prosthetic arm with 
unprecedented precision using 
only her brain to control the 
arm. 

Andrew Schwartz at the 
University of Pittsburgh 
in Pennsylvania and his 
colleagues implanted two 
microelectrodes into the 
womans motor cortex, the 
part of the brain responsible 
for initiating movement. The 
devices recorded cortical 
activity associated with her 


desire to move the robotic 
arm. An external decoder 
connected to the electrodes 
and the arm converted 
these signals into electronic 
instructions for the limb. 

The woman began training 
to use the arm (pictured) 
— which has wrist-joint 
movement comparable to that 
of a human hand — less than 
two weeks after implantation. 
Within two days she could 
make simple movements, 
and within 13 weeks she 
could grasp and move objects 
almost as fast, smoothly and 
accurately as an able-bodied 
person. 
Lancet http://dx.doi. 
org/10.1016/S0140- 
6736(12)61816-9 (2012) 


Precise genomic 
editing 


A set of genetic tools based 
on components of a bacterial 
immune system can make the 
process of adding, deleting 
or replacing specific genes in 
model organisms and human 
cells easier and more precise. 
Current gene-editing 
enzymes are expensive and 
difficult to use. Feng Zhang 
at the Broad Institute of MIT 
and Harvard in Cambridge, 
Massachusetts, and his 
colleagues developed an 
alternative method focusing 
on an enzyme, called Cas9, 
and two RNA molecules from 
a bacterial immune system 
known as CRISPR. These 
molecules work together to 
recognize and cut viral DNA. 
The team modified these 
molecules so that they could 
be used to edit human and 
mouse genomes at precise and 
even multiple locations. 
George Church at Harvard 
Medical School in Boston, 
Massachusetts, and his 
colleagues showed that 
their CRISPR-based gene- 
editing system could be used 
in human cells, including 
reprogrammed stem cells. 
Science http://dx.doi. 
org/10.1126/science.1231143; 
http://dx.doi.org/10.1126/ 
science.1232033 (2013) 
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Direct images of DNA 


> HIGHLY READ 


Nearly 60 years after the structure of 
DNA was first discovered, scientists using 


December transmission electron microscopy have 
captured direct images of it. 

Enzo di Fabrizio at the Italian Institute of Technology in 
Genoa and his colleagues spread droplets of fluid containing 
DNA from a bacterium-infecting virus onto silicon wafers 
that were etched with micrometre-scale pillars and holes. 
As the droplets dried overnight, the DNA stretched taut 
across the tops of the pillars, passing over some of the holes. 
The researchers then imaged the DNA (pictured; scale bar, 
20 nanometres) by beaming electrons through the holes. 

The team observed a structure with helices that repeated 
roughly every 2.7 nanometres (inset, red arrows), consistent 
with A-DNA — anon-physiological DNA conformation 
known to occur in dehydrated samples. 


Nano Lett, 12, 6453-6458 (2012) 


STEM CELLS 


Immune cells 
rejuvenated 


Two groups in Japan have 
reprogrammed cells that fight 
HIV and cancer, boosting the 
number and lifespan of the 
cells. 

The body’s immune 
system makes T cells that 
target viruses and cancer, 
but the pool of these cells 
quickly becomes exhausted. 
Shin Kaneko and Hiromitsu 
Nakauchi at the University of 
Tokyo and their colleagues 
took a group of HIV-specific 
T cells from a person with 
HIV. They reprogrammed the 
cells into stem cells, and then 
added key growth factors so 
that the stem cells developed 
into the original type of 
T cell. The rejuvenated cells 
retained their ability to 


10 JANUARY 2013 | VOL 493 | NATURE | 137 


© 2013 Macmillan Publishers Limited. All rights reserved 


target HIV, had an enhanced 
capacity for proliferation 
and had longer telomeres, 

or caps on the end of their 
chromosomes, suggesting a 
longer lifespan. 

Hiroshi Kawamoto at the 
RIKEN Research Center for 
Allergy and Immunology 
in Yokohama and his 
colleagues used a similar 
approach to reprogram 
T cells specific to melanoma. 
The reprogrammed T cells 
could still recognize a protein 
commonly found on the 
surface of melanoma cells, 
and were present in greater 
numbers than they had been 
in the original population. 
Cell Stem Cell 12, 114-126; 
31-36 (2013) 
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mouse genomes at precise and 
even multiple locations. 
George Church at Harvard 
Medical School in Boston, 
Massachusetts, and his 
colleagues showed that 
their CRISPR-based gene- 
editing system could be used 
in human cells, including 
reprogrammed stem cells. 
Science http://dx.doi. 
org/10.1126/science.1231143; 
http://dx.doi.org/10.1126/ 
science.1232033 (2013) 
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Direct images of DNA 


> HIGHLY READ 


Nearly 60 years after the structure of 
DNA was first discovered, scientists using 


December transmission electron microscopy have 
captured direct images of it. 

Enzo di Fabrizio at the Italian Institute of Technology in 
Genoa and his colleagues spread droplets of fluid containing 
DNA from a bacterium-infecting virus onto silicon wafers 
that were etched with micrometre-scale pillars and holes. 
As the droplets dried overnight, the DNA stretched taut 
across the tops of the pillars, passing over some of the holes. 
The researchers then imaged the DNA (pictured; scale bar, 
20 nanometres) by beaming electrons through the holes. 

The team observed a structure with helices that repeated 
roughly every 2.7 nanometres (inset, red arrows), consistent 
with A-DNA — anon-physiological DNA conformation 
known to occur in dehydrated samples. 


Nano Lett, 12, 6453-6458 (2012) 


STEM CELLS 


Immune cells 
rejuvenated 


Two groups in Japan have 
reprogrammed cells that fight 
HIV and cancer, boosting the 
number and lifespan of the 
cells. 

The body’s immune 
system makes T cells that 
target viruses and cancer, 
but the pool of these cells 
quickly becomes exhausted. 
Shin Kaneko and Hiromitsu 
Nakauchi at the University of 
Tokyo and their colleagues 
took a group of HIV-specific 
T cells from a person with 
HIV. They reprogrammed the 
cells into stem cells, and then 
added key growth factors so 
that the stem cells developed 
into the original type of 
T cell. The rejuvenated cells 
retained their ability to 
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target HIV, had an enhanced 
capacity for proliferation 
and had longer telomeres, 

or caps on the end of their 
chromosomes, suggesting a 
longer lifespan. 

Hiroshi Kawamoto at the 
RIKEN Research Center for 
Allergy and Immunology 
in Yokohama and his 
colleagues used a similar 
approach to reprogram 
T cells specific to melanoma. 
The reprogrammed T cells 
could still recognize a protein 
commonly found on the 
surface of melanoma cells, 
and were present in greater 
numbers than they had been 
in the original population. 
Cell Stem Cell 12, 114-126; 
31-36 (2013) 


© NATURE.COM 

For the latest research published by 
Nature visit: 
www.nature,com/latestresearch 
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POLICY 


Stem-cell finale 
The US Supreme Court has 
guaranteed that government- 
funded researchers will 
continue to be able to work 
with human embryonic stem 
cells. In declining to take up an 
appeal by the plaintiffs in the 
case of Sherley v. Sebelius, 

the high court endeda 
3.5-year legal battle in which 
two researchers working with 
adult stem cells challenged 
the legality of the National 
Institutes of Health (NIH) 
funding the work, which 
requires the destruction 

of days-old embryos. NIH 
director Francis Collins said 
he was “very pleased” with 
the 7 January decision. See 
go.nature.com/ouwpo2 

for more. 


The Polish government on 

2 January imposed a ban on the 
cultivation of two genetically 
modified (GM) crops: 

MON 810, an insect-resistant 
maize (corn); and Amflora, 

a starchy potato used in the 
paper industry. They are the 
only GM crops approved by 
European Union (EU) science 
advisory committees as safe 
for agriculture. Poland is one 
of nine EU members that have 


17 bn 


Number of stars in the 
Milky Way that harbour a 
roughly Earth-sized planet 
in aclose orbit, according to 
researchers at the Harvard- 
Smithsonian Center for 
Astrophysics. They presented 
their study on 7 January at 
the American Astronomical 
Society meeting in Long 
Beach, California. 


Orion bullets shot in detail 


The Gemini South telescope in Chile has been fitted with five 
lasers and three deformable mirrors, allowing astronomers 

to correct for atmospheric distortions over an exceptionally 
large field of view. On 9 January, at a meeting of the American 
Astronomical Society in Long Beach, California, astronomers 
unveiled the telescope’s ultra-sharp portrait of the ‘bullets’ of 
gas seen in the Orion Nebula. In the image, clumps of iron gas 
(blue) race through the nebula, leaving behind pillars of hot, 


glowing hydrogen gas (orange). 


sought to impose such bans 
— despite EU legislation that 
requires all member states to 
permit cultivation of approved 
crops. European courts have 
ruled against bans in two 
countries. Many of the bans 
rely ona legal loophole, with 
countries claiming that there 
is new information on harm 
caused by the crops that is yet 
to be evaluated. 


Energy tax credits 


The US alternative-energy 
industry received a bundle 
of tax incentives in the 
budget deal that was passed 
by Congress on 1 January 
to avert sweeping cuts in 
government spending. The 
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deal offered an extra year 

for electricity producers 

to claim 10 years of tax 
credits for wind, geothermal 
and biomass projects. 
Lawmakers also reinstated a 
tax credit of US$1 per gallon 
(26.5 cents per litre) for 
biodiesel production, applied 
retroactively to 2012 when the 
credit expired, and extending 
through 2013. 


Indian aspirations 
India is setting its sights on 
becoming one of the top five 
global scientific powers by 
2020, and should double its 
research expenditure to 2% 
of its gross domestic product 
in five years, according to 


prime minister Manmohan 
Singh. He was speaking as 

a new policy on science, 
technology and innovation 
was unveiled at the opening of 
the Indian Science Congress 
in Kolkata on 3 January. The 
policy is aimed at increasing 
the number of scientists in 
India by 66% by 2017, and 
at enhancing private-sector 
participation in research. 
Some of its aims, including 
spending targets, were 
proposed in India’s Twelfth 
Five Year Plan, released 

on 27 December 2012. 

See go.nature.com/8te55t 
for more. 


US food safety 

On 4 January, the US Food and 
Drug Administration released 
the first of its long-awaited 
proposals for protecting food 
safety. The draft regulations 
are a result of the Food Safety 
Modernization Act, signed 
into law in January 2011, 

and are intended to shift the 
agency to a preventive, rather 
than reactive, stance on food- 
borne illness. One of the two 
proposed regulations requires 
food makers to develop plans 
for preventing contamination; 
the other sets safety standards 
for farms that grow produce, 
including, for example, 
permitted levels of microbes in 
irrigation water. 


Whistleblower out 


A scientist who set up a 
whistle-blowing website to flag 
up problems in the scientific 
literature revealed his identity 
last week, after he shut down 
the site following legal threats. 
Paul Brookes, a biologist at the 
University of Rochester, New 
York, said he had registered 
www.science-fraud.org in July 
2012, motivated by frustration 
at the channels available 

for dealing with scientific 
misconduct. 
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Success rates stuck 


The success rate for grant 
applicants to the US National 
Institutes of Health (NIH) 
remained unchanged at a 
historic low of 18% in 2012 (see 
graph), the agency announced 
on 2 January. Sally Rockey, 

the NIH’s deputy director for 
extramural research, wrote 

on her Rock Talk blog that 

the rate for research project 
grants was the same as that in 
the 2011 fiscal year, despite a 
3.5% increase in applications 
to 51,313, a record high. 
Average grant size increased 
by 1.1%, to US$454,588. In all, 
the agency spent $15.9 billion 
of its $30.9-billion budget on 
research grants. 


Boston pathogen lab 
The US National Institutes 

of Health (NIH) will support 

a plan for a laboratory in 
Boston, Massachusetts, 

to work with some of the 
world’s most dangerous 


TREND WATCH 


Around 9,500 people lost their 

lives last year in natural disasters 
— less than one-tenth of the ten- 
year average. Had it not been for 
Hurricane Sandy, material losses 


from storms, floods, droughts 


and earthquakes would also have 
been exceptionally low. Overall 
damage from natural disasters 


in 2012 was US$160 billion, 
less than half the 2011 figure 


and slightly below the ten-year 
average, according to the German 
reinsurance group Munich Re. 


Sandy alone accounted for an 
estimated $50 billion. 


pathogens. On 2 January, 

the NIH announced its 

final determination that 
Boston University’s National 
Emerging Infectious Diseases 
Laboratory poses little risk to 
the surrounding community. 
The laboratory, which 

had been under review at 
NIH since 2001, must win 
approvals from state and 
local authorities before it 

can upgrade its research to 
biosafety level 4, the highest 
level of containment. 


Pharma split 

Abbott Laboratories has 

spun off its drug-discovery 
business into an independent 
company called AbbVie, it 
said on 2 January. Abbott, 
based in Abbott Park, Illinois, 
will retain its medical devices, 
diagnostics, nutritional 
products and generic 
pharmaceutical business. 
AbbVie, headquartered in 
North Chicago, Illinois, 


COUNTING CATASTROPHES 


previously functioned as 

an Abbott subsidiary; it 
earned US$17.4 billion in 
2011. The now-separate 

firm plans to expand the 
market for its blockbuster 
anti-inflammatory antibody 
Humira (adalimumab), and to 
push forward candidate drugs 
that include several hepatitis C 
antivirals. 


Stem-cell transfer 


Pioneering biotechnology 
company Geron is shedding 
its assets in human embryonic 
stem cells. The company, 
based in Menlo Park, 
California, announced its 
plans on 7 January. It will 
transfer cell lines, its early 
clinical programme in spinal- 
cord injury, and some 400 
patents and patent applications 
to a subsidiary of BioTime, 
acompany in Alameda, 
California, which is run by two 
former Geron heads. Geron 
funded initial academic work 
to isolate haman embryonic 
stem cells in the 1990s, but 
current management said 

that the technology is a poor 
investment. 


Fetal DNA sale 


A firm that develops non- 
invasive tests for fetal DNA 
was sold to DNA-sequencing 
leader Illumina of San Diego, 
California, for US$350 million, 
the two companies announced 
on 7 January. Verinata Health, 
based in Redwood City, 


Deaths from natural disasters were relatively low in 2012, 
as few catastrophes occurred in developing countries. 


Number of catastrophes Hl 
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15-17 JANUARY 

The nexus between 
water and energy comes 
under tighter focus at 
Abu Dhabi’s annual 
World Future Energy 
Summit, which this year 
is held jointly with the 
inaugural International 
Water Summit. 
go.nature.com/1b68r1 


16-18 JANUARY 
Experts on gravitational 
waves discuss frontiers 
in the field at a meeting 
in Seoul. 
go.nature.com/jugqyh 


California, markets a test for 
chromosomal abnormalities, 
such as Down's syndrome, that 
involves sequencing fetal DNA 
circulating in a mother’s blood. 
The company is embroiled in 
intellectual-property disputes 
with two other firms in a 
prenatal genetic-testing market 
potentially worth billions (see 
Nature 486, 454; 2012). On the 
same day, Iumina’s share price 
dropped by 7% after drug giant 
Roche said that it did not plan 
to buy the company. 


Spill settlement 


Drilling company Transocean, 
based in Vernier, Switzerland, 
reached a US$1.4-billion 
settlement with the US 
Department of Justice on 

3 January for charges related to 
the 2010 Deepwater Horizon 
disaster, in which one of the 
company’s oil platforms in 

the Gulf of Mexico exploded. 
The deal follows a $4.5-billion 
settlement last year by 
oil-and-gas giant BP (see 
Nature 491, 501; 2012). It 
includes $150 million each 

for the National Academy 

of Sciences and the National 
Fish and Wildlife Foundation, 
to support environmental 
restoration and protection and 
health research in the region. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 


10 JANUARY 2013 | VOL 493 | NATURE | 139 
© 2013 Macmillan Publishers Limited. All rights reserved 


JOHN GUILLEMIN/BLOOMBERG VIA GETTY 


NEWSIN FOCUS 


MATERIALS Technology from 
old TVs offers way to light up 


nanostructures p.143 costs p.144 


ENVIRONMENT Mercury treaty 
struggles with clean-up 


BIOMEDICINE Prospects 


brighten for induced 
stem-cell therapies p.145 


NEUROSCIENCE How ‘number 
blindness’ hampers 
learning p.150 


The Belchatow power station in Poland is Europe’s largest coal-burning plant, but plans to capture carbon dioxide from it are in limbo. 


Europe’s untamed carbon 


Funding and politics hobble CCS technology, seen as the best hope for cleaning up coal. 


BY RICHARD VAN NOORDEN 


ith its carbon-trading market and 
tough emissions targets, Europe 
plays the part of responsible adult 


at climate-policy negotiations. But in a grow- 
ing blemish on its low-carbon image, the 
region has fallen behind North America in 
the slow crawl to demonstrate systems for 
capturing greenhouse-gas emissions from 
power plants and industry — even as it 
increases its use of coal. 


Announcements made just before Christ- 
mas underlined Europe's troubles in launch- 
ing large carbon capture and storage (CCS) 
projects. A European Commission fund set 
up two years ago in part to support CCS could 
not find a single scheme to finance, and instead 
gave €1.2 billion (US$1.6 billion) to renewable- 

energy projects. And in 


> NATURE.COM a mortifying side note, 
Toreadmoreabout —_ the International Energy 
CCS, see: Agency (IEA) noted that 
go.nature.com/enhtkr as the switch from coal 


to shale gas in the United States lowers the 
price of coal, Europe — where gas is expen- 
sive — is burning more of this dirtier fuel. “It is 
hugely embarrassing for Europe’ international 
standing on climate,” says Vivian Scott, who 
studies the role of CCS in climate policy at the 
University of Edinburgh, UK. 

Admittedly, no country is finding CCS easy 
to fund. The technology to sieve carbon diox- 
ide from exhaust gases has been demonstrated 
ona small scale, and four large projects have 
successfully stored the gas underground. > 
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> But not one large carbon-capture system 
is operating at a power plant anywhere in the 
world. Fitting a large power plant with CCS 
would increase the price of its electricity by 
50-100%, requiring hundreds of millions of 
dollars in subsidies. 

Four years ago, the IEA set out a road map 
that called for building 100 CCS projects by 
2020, together capturing around 150 million 
tonnes of carbon dioxide annually. Monitor- 
ing progress today, Scott says that the world 
will be lucky to see 20 completed in that time 
(see ‘Carbon capture sputtering’). “The reality 
is that incentives in place for renewable energy 
are not in place for CCS. Government policies 
are not ambitious enough to drive large CCS 
projects,” says Juho Lipponen, head of CCS at 
the IEA. Governments have pledged an esti- 
mated $20 billion for CCS demonstrations, 
but more like $100 billion would be needed 
to meet the IEA’ original road map, he adds. 


EUROPEAN DELAYS 

Yet the CCS industry in Europe has been hit 
particularly hard, by a combination of eco- 
nomic recession, political reluctance and 
Brussels bureaucracy. Europe has favoured 
wind and solar projects, with both direct grants 
and ‘feed-in tariffs’ that guarantee high prices 
for low-carbon electricity. Per unit of elec- 
tricity, these subsidies have been as generous 
as those needed for CCS — but because the 
projects are smaller, individual payments are 
less daunting. CCS has had some direct grants 
but no feed-in tariffs; instead, plant opera- 
tors were expected to 


benefit from Europes “Time will tell 
emissions-trading whether this 
scheme, under which technology hasa 
polluting industries market. Fornow 
trade allowances to it’s important 
emit carbon dioxide. tosee these 
Utility companies plants up and 


that built CCS pro- 
jects would be able to 
sell excess carbon credits, and the sale of 300 
million credits would also finance a European 
CCS funding scheme (known as NER300). 
But the price of credits collapsed to €6-7 per 
tonne last year, as the recession cut industrial 
emissions. Last year, a sale of 200 million credits 
raised only €1.5 billion, one-third of what plan- 
ners had hoped for. “The collapse of the emis- 
sions-trading-scheme price is the biggest culprit 


running.” 


CARBON CAPTURE SPUTTERING 


A 2009 road map from the International Energy 
Agency (IEA) foresaw carbon capture and storage 
(CCS) projects progressing at a much faster pace 
than is supported by current reality. 


HB IEA road map 
1) Projected CCS 
progress* 


Total projects 


2010 2015 2020 


*Projections from Vivian Scott, Scottish Carbon Capture and Storage 


for the lack of progress,’ says Howard Herzog, 
a sequestration researcher at the Massachusetts 
Institute of Technology in Cambridge. 

Even so, the commission’s fund had short- 
listed ten CCS projects by July 2012, waiting 
for guarantees of co-funding from member 
states. But five of the six countries in which 
the plants were to be built — the United 
Kingdom, the Netherlands, Poland, Italy and 
Romania — “didr't step up to the plate,” says 
Chris Davies, a British member of the Euro- 
pean parliament who advocated for the CCS 
fund. Meanwhile, a plan to capture carbon 
emissions from a French steel plant became a 
casualty when owners ArcelorMittal, based in 
Luxembourg, pulled out of the scheme. 

The missed opportunity has delayed CCS in 
Europe by two years, says Philippe Paelinck, 
who oversees CCS development at French 
engineering firm Alstom, based in Levallois- 
Perret outside Paris. Alstom runs carbon- 
capture technology at a number of pilot 
projects and had hoped to scale it up at three 
NER300 candidate projects. Nevertheless, the 
NER300 fund will award a second phase of 
grants, potentially this year; Paelinck thinks 
CCS projects in the United Kingdom, Roma- 
nia and the Netherlands are well placed to gain 
some funding. Separate funding schemes, such 
as a £1-billion (US$1.6-billion) CCS compe- 
tition in the United Kingdom — which is 


revamping its electricity market to allow tariff- 
like support for CCS — could also launch pro- 
jects. “There's still activity for CCS in Europe: 
it’s just proving to be a much harder sell than 
initially thought,’ says Lipponen. 


AHELPING HAND FROM OIL 

In the United States and Canada, CCS project 
developers have made faster headway by com- 
bining public subsidies with sales of the carbon 
dioxide they capture. Oil companies inject the 
gas — which sells for $20-40 per tonne — 
into the ground to help to loosen trapped oil. 
Two large North American projects are due 
to start capturing carbon in 2014: Canada’s 
Can$1.24-billion (US$1.26-billion) Boundary 
Dam project in Saskatchewan, a 100-megawatt 
project that would rebuild one unit ofan exist- 
ing coal-fired plant; and the US$2.4-billion 
Texas Clean Energy Project near Odessa, a new 
400-megawatt power plant that would gasify 
its coal before burning. A similar $2.4-billion 
plant in Kemper County, Mississippi, is also 
scheduled to start operation in 2014 but is cur- 
rently facing a legal dispute over whether it can 
raise local electricity rates. 

Funding CCS by striking deals with oil 
producers is not an option in oil-poor parts 
of Europe, however. And Stuart Haszeldine, 
a geologist at the University of Edinburgh, 
cautions that although selling carbon dioxide 
for oil extraction may help to get the industry 
going, it comes at the cost of the carbon emit- 
ted from the extra oil: he claims (in unpub- 
lished calculations) that once oil production 
is factored in, CCS funded in this way could 
actually cut emissions by only 10%. 

Still, CCS advocates say that large-scale 
demonstrations of the technology are crucial 
because CCS offers the only way to clean up 
emissions from coal, which the IEA predicts 
will overtake oil to become the world’s lead- 
ing energy source by 2017. “Time will tell 
whether this technology has a market. For 
now it’s important to see these plants up and 
running,” says Julio Friedmann, head of the 
carbon-management programme at Lawrence 
Livermore National Laboratory in California. 
No country with long-term climate obliga- 
tions could seriously meet them without CCS, 
he says. “Yet Europe is shutting down nuclear 
plants, building coal plants, and not keenly 
pursuing CCS. So the question is, is it serious 
about cutting emissions?” m 
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Electron beams set 


IN FOCUS | NEWS 


nanostructures aglow 


Physicists borrow an old tool from geology to probe the interior layers of metamaterials. 


BY EUGENIE SAMUEL REICH 


ut a piece of quartz under an electron 
Pp microscope and it will shine an icy blue. 

First noticed in the 1960s, the phenom- 
enon, called cathodoluminescence, gave geolo- 
gists an easy way to identify quartz and other 
minerals in rock samples. But the light — emit- 
ted after a beam of electrons kicks a material's 
own electrons into a higher-energy state — is 
faint and diffuse, and that discouraged other 
scientists from harnessing it for fine-scale 
imaging. 

Nowa Dutch group has found a way to col- 
lect and focus a particularly faint and local- 
ized type of cathodoluminescence that had 
been previously ignored, turning the glow 
into a precise probe of a material’s nanoscale 
structure. The researchers expect the technol- 
ogy to reach the market early this year, giving 
materials scientists a new tool for investigating 
the behaviour of light in the interiors of the 
complex nanostructures used in lasers, light- 
based circuits and solar cells. 

“We have built a cathodoluminescent sys- 
tem that is unique in the world,” says Albert 
Polman, a physicist at the FOM Institute for 
Atomic and Molecular Physics (AMOLF) in 
Amsterdam. 

The technique combines the advantages 
of optical and electron-based imaging. An 
electron beam can in principle achieve a 
resolution of less 


than one nanometre, “This has opened 
compared withhun- the door to 

dreds ofnanometres understanding 
for a beam of light. how light 

But maps made by couples to 
scattered or reflected matterina more 
electrons are not fundamental 
typically sensitive to way. ” 


the way light behaves 

in the sample. Cathodoluminescence, by 
contrast, can map the interaction of light 
and matter — but, because it is triggered by a 
narrow beam of electrons, it promises the 
same nanometre-scale resolution that those 
systems can achieve. 

“This has opened the door to understand- 
ing how light couples to matter in a more 
fundamental way,” says Harry Atwater, 
a physicist at the California Institute of 
Technology in Pasadena, who worked with 


METAMATERIAL TV 


A system of lenses and mirrors makes it possible to use cathodoluminescence, the principle of a 
cathode-ray-tube television, to map complex nanoscale structures within engineered ‘metamaterials’. 


Scanning 
electron beam 


Cathodoluminescence 


— 


Sample 


Polman on cathodoluminescence in the past 
but is not involved with the work on the latest 
technique. 

Key to the system is ultraprecise alignment 
of the sample. The device includes a care- 
fully shaped parabolic mirror that collects 
photons as they emerge from a sample bom- 
barded with electrons. It sends them through a 
system of lenses and a spectrometer to resolve 
the origin and energy of the light (see ‘Meta- 
material TV’). The device is sensitive enough 
to pick up a signal even from materials that 
are barely luminescent, such as metals. Just as 
in an old-fashioned cathode-ray-tube colour 
television, the electron beam scans the sample 
to build up an image line by line. 

Already, the Dutch group, along with col- 
laborators in the United States and Spain, 
has used the technique to tease out how cer- 
tain nanostructures interact with light. In a 
recent paper studying a layered construct of 
silver, glass and silicon, they show that the 
phase velocity of visible light — the speed 
at which the peaks and troughs of the wave 
travel through the material — is so fast that 
it is effectively travelling in a vacuum, the 
explanation for the material's overall refrac- 
tive index of zero (E. J. R. Vesseur et al. Phys. 
Rev. Lett. 110, 013902; 2013). 

The effect had been predicted in such lay- 
ered structures, called metamaterials, but 
observing it required a higher-resolution map 


Parabolic mirror 


Lens 


Flip mirror Filter 


of light emission than earlier techniques could 
produce. The team has also mapped the dis- 
tribution of light in the silicon nanodiscs that 
are used as a coating on solar cells to improve 
efficiency, and in the ultra-small cavities of 
photonic crystals — components of chip-based 
lasers and light-emitting diodes. 

Lukas Novotny, an optical physicist at 
the Swiss Federal Institute of Technology 
in Zurich, says that cathodoluminescence 
could be a useful tool for improving the per- 
formance of light-emitting devices and solar 
cells, because the light-emission maps created 
with the technique reflect the local density of 
electromagnetic states, a quantity that deter- 
mines how well light couples to matter and 
vice versa. “This information is key,’ he says. 

The optics start-up company Delmic, 
based in Delft, the Netherlands, has licensed 
AMOLF'’s cathodoluminescence technique, 
and Polman says that the company will soon 
be selling the devices to materials researchers 
in universities for between US$100,000 and 
$200,000; later it may target the laser, semi- 
conductor and solar-cell industries. He real- 
izes that, by selling the system, he may create 
competitors for his own research. But he says 
that that will be more than made up for by the 
creation of a community of scientists using 
cathodoluminescence outside the ranks of 
geologists. “We want to motivate people to get 
out there and do this,” he says. m 
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Small-scale gold mining is a growing source of toxic mercury emissions. 


ENVIRONMENT 


Tough talk over 
mercury treaty 


Nations debate how to share the costs of cutting emissions. 


BY JANE Qiu 


overnments are on the verge of agreeing 

the first legally binding, global treaty 

to tackle mercury pollution. It aims 

to clean up the legacy of centuries of untram- 

melled emissions of the toxic metal, and to limit 

future contamination from sources as diverse as 
coal-fired power plants and gold mining. 

Delegates from 128 countries are expected 

to meet next week in Geneva, Switzerland, fora 

fifth and final round of treaty negotiations. All 


QUICKSILVER QUANTIFIED 


agree that action is needed urgently to reduce 
mercury emissions, which pose risks to the 
environment and human health. But consen- 
sus on how to achieve that will not come eas- 
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know why they should shoulder the burden of 
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air, soil, rivers and oceans, it is crucial that the 
negotiations deliver “a global treaty that is going 
to be implementable in all the countries’, says 
Fernando Lugris, chairman of the Intergovern- 
mental Negotiating Committee of the United 
Nations Environment Programme (UNEP), 
which is coordinating the treaty. 

According to a draft version of UNEP’s 2013 
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a genie that has escaped the bottle,” says David 
Streets, an environmental scientist at Argonne 
National Laboratory in Illinois. “It takes dec- 
ades or even centuries to get it fixed” 

The UNEP report points out that mercury 
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oceans have doubled in the past century; top 
marine predators in the Arctic Ocean contain 
up to 12 times more mercury than in pre- 
industrial times, and contaminated seafood is 
a significant source of human exposure. The 
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cury as one of the top ten chemicals of major 
public health concern — it can damage the 
brain and is a particular hazard for fetuses. 

Streets estimates that human activity has 
released a grand total of about 350,000 tonnes 
of mercury, with roughly 40% of those emis- 
sions occurring before 1850 (see ‘Quicksilver 
quantified’). Much of the rest came from silver 
and gold mining in the late nineteenth century, 
when mercury was used to extract precious 
metals. However, emissions have soared in 
recent decades, propelled by small-scale, or arti- 
sanal, gold mining in Africa and Latin America, 
and rapid industrialization in Asia (D. G. Streets 
et al. Environ. Sci. Technol. 45, 10485-10491; 
2011). China is now the biggest emitter, contrib- 
uting 30% of global anthropogenic emissions. 

Cleaning up industrial processes, for exam- 
ple by capturing mercury released from burn- 
ing coal, involves costly technology. In the 
negotiations, “the biggest contention is who 


Anthropogenic mercury emissions are rising in Asia, but pollution more than a century old persists globally, raising questions about who should pay to clean up the metal. 
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should pay”, says Ludovic Bernaudat, a 
mercury expert at the United Nations Indus- 
trial Development Organization in Vienna. 
“There are a lot of tensions right now.’ 
Before the latest round of negotiations, 
in June 2012, the discussions aimed at an 
agreement for all countries to cap mercury 
emissions. But developing nations such as 
China and India — second only to China as 
an emitter — were adamant that this would 
be unfair unless developed nations helped 
with the cost and technologies. Common 
measures for controlling air pollution have 
the potential to reduce mercury emissions 
from coal plants by about 36%, “but to go 
further youd need specific mercury-control 
technologies that can remove 90% of emis- 
sions, which are only available in developed 
countries’, says Wang Shuxiao, an environ- 
ment scientist at Tsinghua University in Bei- 
jing, who is part of the Chinese delegation. 
Developed nations seem unlikely to com- 
mit to funding the transfer of such technolo- 
gies. Negotiators may settle on an agreement 
that requires countries to set national targets 
that they can meet with the best mercury- 
control measures available to them, and to 
beef up monitoring programmes. UNEP 
predicts that such measures could reduce 
emissions in industrial regions by 25% by 
2020, compared with an increase of up to 
25% under a business-as-usual scenario. 
The treaty also aims to limit emissions 
from artisanal gold mining, which is largely 
unregulated. Miners soak crushed ore in 
mercury to form an amalgam that leaves 
impurities behind; heating the amalgam 
frees the gold, but releases mercury into the 
air. “Most of them are unaware of the health 
hazards of mercury vapour and nobody 
wears a mask,” says Nicola Pirrone, direc- 
tor of the Institute of Atmospheric Pollu- 
tion Research in Rome. The treaty is likely 
to recommend that countries register and 
monitor mining, and will encourage tech- 
nologies that capture mercury vapour or 
use jets of water and air to separate gold 
from ores (see Nature 486, 306-307; 2012). 
In late February, at the Global Ministerial 
Environment Forum in Nairobi, UNEP’s 
governing council will debate the draft 
treaty that will emerge from next week’s 
meeting. UNEP expects countries to ratify 
the treaty later this year. Even if the treaty 
does not set binding caps, “it should still 
build enough momentum for countries 
to commit to serious efforts to tackle the 
problem’; says John Munthe, an environ- 
ment-policy researcher at the Swedish Envi- 
ronmental Research Institute in Stockholm. 


“There are plenty of 
> NATURE.COM low-hanging fruits 
Read more about that could make a big 
pollutioncausedby difference in reduc- 
gold mining: ing global mercury 
go.nature.com/wlargr emissions.” 
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Safety of induced 
stem cells gets a boost 


Fears of immune response have been overestimated. 


BY MONYA BAKER 


paper published in Nature today’ could 
A™ a cloud over the hopes of turn- 

ing a patient's own cells into perfectly 
matched replacement tissues. 

Scientists first reported in 2007 that a 
person's cells could be reprogrammed to an 
embryo-like state, and so could form any type 
of cell in the body. Medical researchers imme- 
diately imagined using these ‘induced pluripo- 
tent stem (iPS) cells’ to create an endless supply 
of genetically matched replacement tissues to 
treat a range of diseases: fresh pancreatic tissue 
for diabetics, for example, or new nerve cells 
for people with Parkinsons. 

The strategy also seemed to offer a way 
around the ethical complexities of using stem 
cells derived from human embryos. But then 
came the worries about possible side effects. 
Particularly bad news came from a 2011 study” 
showing that iPS cells provoked immune 
responses when injected into the mice from 
which they had been derived, casting doubt 
over one of the key advantages of the cells. 

The latest Nature study’ rejects that conclu- 
sion. Masumi Abe, a geneticist at the National 
Institute of Radiological Sciences in Chiba, 
Japan, and his team took iPS cells derived from 
mice and injected them back into the animals. 
For comparison, they injected other mice with 
embryonic stem (ES) cells. Yet unlike the 2011 
study, which saw iPS cells perform worse than 
ES cells, the team found no differences between 
the immune responses of each group. The 
researchers also transplanted skin and bone- 
marrow cells derived from iPS or ES cells into 
mice and achieved similar success rates between 
the groups. The immune response of both sets 
of tissues is “indistinguishable’, says Abe. 

Konrad Hochedlinger, a stem-cell scientist 
at Massachusetts General Hospital in Boston, 
says that the result will probably “calm people 
down” about iPS cells. “It is definitely reassur- 
ing,” he says. 

The findings follow another positive study 
on iPS cells, published late last year® , Which 
found that the reprogramming process causes 
fewer mutations than previously thought. 
Flora Vaccarino, a neuroscientist at Yale Uni- 
versity in New Haven, Connecticut, and her 
colleagues used high-resolution DNA analysis 
to compare the genomes of iPS cells and the 
adult cells from which they were derived. They 


found that most of the DNA mutations in the 
iPS cells did not arise during reprogramming 
but had been present in the parent cells. 

Yang Xu, a stem-cell scientist at the Univer- 
sity of California, San Diego, and co-author of 
the 2011 study’, says that the new work does 
not dispel all concerns about the immune 
response provoked by iPS cells. 

Xu points out that the skin and bone-marrow 
cells used in the latest study were not grown 
from iPS cells in culture, as they would be for 
clinical use. Instead, the researchers mixed iPS 
cells into early mouse embryos to make ‘chima- 
eric embryos. They then used skin and bone- 
marrow tissues that arose from iPS cells after 

the embryos grew 


“The result will into adult mice for 
probably calm their transplantation 
people down. experiments. It is pos- 
Itis definitely sible, says Xu, that the 


most immunogenic 
cells were rejected 
as the mice developed, which would explain 
why Abe and his colleagues observed a limited 
immune response. Transplanting tissues from 
chimaeric mice is “flawed”, he says. 

Producing chimaeric embryos is a standard 
technique for testing whether mouse iPS cells 
have been fully reprogrammed, says Jakub 
Tolar, a clinician at the University of Minne- 
sota in Minneapolis, but he notes that differ- 
entiating cells in culture outside the body is 
much harder. Tolar, who hopes to use iPS cells 
to treat the childhood skin disease epidermoly- 
sis bullosa, adds that iPS-cell therapies will use 
human cells, which could behave quite differ- 
ently from mouse cells. “It's helpful that they’ve 
done this, but it is absolutely different when 
you go to something that is cultured,” he says. 

Hochedlinger believes that iPS cells are 
just as promising for cell transplantation as 
ES cells, although many issues stand between 
the lab and the clinic. The differences between 
the two kinds of stem cell are minor compared 
with the differences in how individual cell lines 
grow and differentiate in culture, he says. 

“Based on what we know at this time from 
mice,” he says, “iPS cells are as good as ES cells, 
and should be as safe.” = 


reassuring.” 
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‘Rehab’ helps errant researchers 
return to the lab 


As cases of misconduct rise, ethicists test an approach for reforming offenders. 


BY DANIEL CRESSEY 


n 8 January, four health scientists 
() accused of research misconduct are 

expected to file into a room at Saint 
Louis University in Missouri to conduct an 
experiment: can they be rehabilitated? Using 
pseudonyms if they wish, the four will discuss 
what they did wrong, why they did it and how 
they might stop it from happening again — all 
in an effort to see if they can re-enter the 
scientific mainstream. 

With the rapid growth of misconduct cases, 
some scientists are worried that preventative 
training in research ethics might not be enough. 
Nor will it be possible simply to dismiss all 
violators from science. Scientific rehabilitation, 
they say, will have to become a necessary tool 
for research-integrity offices. 

“Sometimes these are very talented research- 
ers, says James DuBois, an ethicist at Saint Louis 
University, who leads the rehab programme, 
called RePAIR (Restoring Professionalism and 
Integrity in Research). “We believe that if we can 
equip them with certain skills, they can return to 
the field as very productive individuals.” 

DuBois could have plenty of customers on the 
way. The US Office of Research Integrity (ORI) 
in Rockville, Maryland, received 419 allega- 
tions of misconduct at institutions in 2012 — 
nearly double the number in 2011, says David 
Wright, director of the ORI. Wright puts much 
of this down to software that has made it easier 
for watchdogs to detect possible plagiarism or 
image manipulation in publications. “We now 
get allegations from all over the world, he says. 

As the pursuit of misconduct increases, there 
has been less focus on how to deal with guilty 
scientists. Lauran Qualkenbush, director of 
the research-integrity office at Northwestern 
University in Chicago, says that there is a gap 
between harsh penalties for misconduct — such 
as bans from receiving government funding — 
and lighter punishments such as taking a short 
online ethics course and getting a letter of rep- 
rimand. “It's not always black and white,” she 
says. “These are people we feel could be valuable 
members of our faculty and community.’ 

DuBois aims to fill that gap. Although 
remedial courses have been available for 
physicians for more than a decade — with 
many returning to medicine to forge successful 
careers — DuBois says that RePAIR is the first 
such programme for researchers. Developed 


James Dubois hopes that his programme will let 
wayward researchers back into the scientific fold. 


with a US$500,000 grant from the National 
Institutes of Health, the three-day course costs 
$3,000 per participant, or half that if host insti- 
tutions sign up as partners for $1,500 a year. 

Day one of the course focuses on ‘self-serving 
biases’ — in which people take credit for success 
but blame external factors for failures — and on 
getting participants to examine the factors that 
have led to them breach the accepted rules of 
research. The second day gets participants to 
talk about what they did wrong, and focuses 
on skills to promote ethical decision-making, 
such as what they could have done differently 
in their own research. On the final day, profes- 
sional management plans are drawn up for each 
participant to try to prevent them falling back 
into bad habits, with follow-up interviews tak- 
ing place in the weeks after the course. If par- 
ticipants complete all of these activities to the 
satisfaction of the RePAIR team, a certificate of 
completion is sent to their institution. 

Last year, DuBois conducted a survey of 
129 US institutions and found that most were 
investigating between three and five cases of 
wrongdoing per year. He says that, so far, around 
six universities, including Northwestern, have 
signed up as partners in the programme. 

Some ethicists are unsure how effective such 
rehab will be. Nicholas Steneck, an ethicist at 


the University of Michigan in Ann Arbor, is 
broadly supportive of the goals of RePAIR, but 
says that he wonders if the money might be bet- 
ter spent on measures that prevent misconduct. 

Although the ORI is not endorsing RePAIR 
specifically, Wright says that his previous job 
as a research-integrity officer at Michigan 
State University in East Lansing has made him 
well aware of the need for such a programme, 
especially for researchers who have committed 
low-level misconduct, such as failing to disclose 
a conflict of interest. “What do you do with 
them to make them whole as members of the 
research community, so they’re not just pariahs 
in their community forever? My view is if you're 
not going to fire somebody, there ought to bea 
way for them to re-establish themselves.” 

Wright says it is too early to say if DuBois and 
colleagues’ programme will work, but thinks 
that it is “a worthy effort and I salute them” 

Scrutiny of misconduct is also growing 
rapidly in Europe. According to Steneck, 
who consults for Epigeum, a London-based 
company that provides training on research 
integrity, that is because the definition of mis- 
conduct is broadening beyond falsification, 
fabrication and plagiarism. ‘Grey area cases 
such as self-plagiarism and duplicate publi- 
cation are increasingly attracting attention, 
he says. However, the extent of misconduct 
is harder to establish in Europe than in the 
United States, because the overarching Euro- 
pean Union research-funding system — as 
well as those of individual countries like the 
United Kingdom — lack an official investiga- 
tory body similar to the ORI. This means that 
responsibility for dealing with misconduct can 
fall solely on individual institutions. 

A session of the RePAIR programme in 
May has already been advertised, and DuBois 
eventually hopes to run the course for up to ten 
people at a time. 

“A lot of people will say to me, “These are 
bad apples — you're not going to reach them,” 
he says. “My sense is that may be true of 10%, 
15%, maybe even 20%, but I do believe that we 
can make a difference in the lives of the vast 
majority.” m 


CORRECTION 

The News Feature ‘Nature’s 10’ (Nature 492, 
335-343; 2012) gave the wrong age for Jun 
Wang. He is 36, not 34. 
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Brian Butterworth is on a crusade 
to understand the number deficit 
called dyscalculia — and to help 
those who have it. 


NUMBER 
GAMES 


BY EWEN CALLAWAY 


journeyed with a film crew into Afghanistan to produce a 

documentary about the fifth anniversary of the Soviet inva- 
sion. The trip took them behind Soviet lines. “We were attacked 
every fucking day by the Russians,” says the colourful Welshman. 
But the real trouble started later, when Moorcraft tried to tally his 
expenses, such as horses and local garb for his crew. Even with a 
calculator, the simple sums took him ten times longer than they 
should have. “It was an absolute nightmare. I spent days and days 
and days.” When he finally sent the bill to an accountant, he had 
not realized that after adding a zero he was claiming millions of 
pounds for a trip that had cost a couple of hundred thousand. “He 
knew I was an honest guy and assumed that it was just a typo.” 

Such mistakes were part of a lifelong pattern for Moorcraft, 
now director of the Centre for Foreign Policy Analysis in London 
and the author of more than a dozen books. He hasn't changed 
his phone number or PIN in years for fear that he would never 
remember new ones, and when working for Britain’s Ministry 
of Defence he put subordinates in charge of remembering safe 
codes. In 2003, a mistaken phone number — one of hundreds 
before it — lost him a girlfriend who was convinced he was out 
gallivanting. That finally convinced him to seek an explanation. 

At the suggestion ofa friend who teaches children with learning 
disabilities, Moorcraft contacted Brian Butterworth, a cognitive 
neuroscientist at University College London who studies numeri- 
cal cognition. After conducting some tests, Butterworth concluded 
that Moorcraft was “a disaster at arithmetic” and diagnosed him 
with dyscalculia, a little-known learning disability sometimes called 
number blindness and likened to dyslexia for maths. Researchers 
estimate that as much as 7% of the population has dyscalculia, 
which is marked by severe difficulties in dealing with numbers 
despite otherwise normal (or, in Moorcraft’s case, probably 
well above normal) intelligence. 

That combination has attracted neuroscientists such as 


| n the mid-1980s, Paul Moorcraft, then a war correspondent, 
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Butterworth, who believe that the disorder illuminates the inner 
workings of the brain’s number sense — the ability to understand 
and manipulate quantities. This sense is every bit as innate as 
vision or hearing, yet scientists disagree over its cognitive and 
neural basis, a debate that dyscalculics may help to settle. 

For Butterworth, scientific curiosity eventually gave way to 
advocacy. “I thought, it’s not enough to just try to identify the 
cause,” he says. In the past decade, he has crusaded to get dyscal- 
culia recognized — by parents, teachers, politicians and anyone 
who will listen. And he is using his scientific insights into the 
condition to help dyscalculic children. “What's the point of telling 
someone they have dyscalculia if you can't help them?” he says. 


FINDING THE NUMBER 

Christopher, a chatty nine-year-old in a rumpled blue sweatshirt 
and white polo shirt, sits beside Patricia Babtie, a teacher who 
specializes in dyscalculia and tutors children across Greater 
London. Ona sturdy-looking laptop, Christopher (not his real 
name) is navigating Number Sense, a suite of educational com- 
puter games designed by Butterworth and his colleague Diana 
Laurillard at the Institute of Education in London. 

By developing treatments for dyscalculia, Butterworth hoped to 
test competing theories about the cognitive basis of numeracy. If 
as he believes, dyscalculia is at heart a deficiency of basic number 
sense and not of memory, attention or language, as others have 
proposed, then nurturing the roots of number sense should help 
dyscalculics such as Christopher. “It may be the case that what 
these kids need is just much more practice than the rest of us,” But- 
terworth says. Christopher’s school is one of several in London 
working with the software, and students in Cuba, Singapore and 
elsewhere will also soon start using it. 

Christopher starts with a game involv- 
ing a number line — a spatial representa- 
tion that scientists believe is key to number 
sense. “What is the number that is right in 
the middle between 200 and 800? Do you 
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HOW FAST CAN 
YOU COUNT b> 
THESE DOTS? 


Children with dyscalculia take 
longer than other children 

to count dot patterns. Most 
instantly recognize patterns 
of up to four dots, whereas 
dyscalculics tend to count the 
dots one by one. 


know it?” Babtie asks. Christopher shrugs. “Think of any number 
that is bigger than 200 and smaller than 800 and put it in this box. 
It could be 201,’ she says. He enters 200, and Babtie reminds him 
that the number needs to be greater than 200. He selects 210, prob- 
ably mistaking it for 201. A classic sign of dyscalculia is difficulty in 
grasping the place-value system, according to Babtie. “That will do 
fine,’ she says. A soft computer voice tells Christopher to “find the 
number and click it” The game involves zooming in and zooming 
out to rescale the number line, and Christopher talks through each 
move — a strategy that Babtie encourages — but it takes him more 
than a minute to locate 210. His classmates, meanwhile, are learning 
to multiply two-digit numbers. 

Some children at Christopher’s school have more profound 
numeracy problems. One nine-year-old classmate says that she 
doesn't know if 50 is greater or less than 100; another the same age 
confuses four dots for five and routinely tots up small sums on his 
fingers, acommon strategy for dyscalculics. 

“OK, time to stop. We'll do some more of this another day,” 
Babtie says to Christopher, after 20 frustrating minutes. It is clear 
he would rather be back in class than here in this room practising 
a skill his classmates learned years ago. 


HOW MANY ARE THERE? 

Butterworth, now 69, straddles the academic and public spheres. 
A fellow of the British Academy, the United Kingdom's national 
body for the humanities and social sciences, he made his name 
probing obscure speech and language disorders and has appeared 
in the British media for many years. In a Sunday Times article in 
1984, for example, Butterworth claimed that the speech patterns of 
former US president Ronald Reagan indicated Alzheimer’s disease. 
Reagan was diagnosed with the condition a decade later. 

In the late 1980s Butterworth studied a stroke patient who 
would change the course of his professional life. The woman, a 
59-year-old former hotel manager from Italy, fared about average 
on verbal IQ tests and had a good memory, but when Butterworth’s 
Italian colleagues asked her to count, she would start, “uno, due, 


tre, quattro, and then stall. “Miei matematica finisce alle quattro” 
— my mathematics stops at four — the woman, known as CG, 
would tell them’. 

Neurologists had presented case studies of ‘acalculic’ patients 
such as CG from the early twentieth century onwards, if not 
before, but “people hadn't thought a lot about the specific brain 
areas involved in calculation’, says Butterworth. Brain scans of CG 
revealed a lesion in the parietal lobe, a part of the brain just above 
the ears. Later, Butterworth found another patient with the oppo- 
site pattern of disability: neurodegeneration had robbed him of 
speech, language and much of his knowledge, save for the ability 
to do intricate calculations. Butterworth grew more certain that 
numerical abilities relied on specialized brain networks, and not 
only on those supporting general intelligence, as many scientists 
believed at the time. 

Genetics and the vagaries of brain development disrupted these 
networks in dyscalculics, Butterworth proposed. And Moorcraft 
was one of Butterworth’s most revelatory subjects because of the 
great disparity in his abilities in different domains. Butterworth 
and his colleagues also tested 31 eight- and nine-year-old children 
who were near the bottom of their class in mathematics but did 
well enough in other subjects. Compared with normal children 
and those with dyslexia, the dyscalculic children struggled on 
almost every numerical task, yet were average on tests of reading 
comprehension, memory and IQ). 

The study confirmed for Butterworth that developmental forms 
of dyscalculia are the result of basic problems in comprehending 
numbers and not in other cognitive faculties. But determining 
exactly what those problems are would prove challenging. 

Like nearly all human cognitive abilities, number sense is evo- 
lutionarily ancient — tens if not hundreds of millions of years old. 
Studies of chimpanzees, monkeys, newborn chicks, salamanders 
and even honeybees point to two parallel systems for represent- 
ing quantities. One, called the approximate number sense, distin- 
guishes larger quantities from smaller ones, be they dots flashing 
on ascreen or fruits in a tree. Studies on monkeys reveal that 
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Brian Butterworth hopes that his number games will help dyscalculic children — and open a window on how the brain processes numbers. 


certain neurons in a specific fold of the parietal lobe fire more 
vigorously in response to increasingly higher numbers’. A second 
ancient number system allows humans and many other animals 
to instantly and precisely recognize small quantities, up to four. 
Primate studies show that individual neurons within the same 
fold, called the intraparietal sulcus, seem tuned to particular 
quantities, such that when a monkey is performing a task that 
involves numbers, one neuron will fire for the number 1, a 
different one will fire for 2 and so on’. 

People who are poor at distinguishing approximate quanti- 
ties do badly in maths, suggesting that the approximate-number 
system is crucial*, And some work shows that dyscalculics are 
poor at recognizing small numbers, suggesting that this ability is 
also fundamental to numeracy’. Moreover, scans of people with 
dyscalculia suggest that their intraparietal sulci are less active 
when processing numbers’ and less connected with the rest of 
the brain’ compared with numerate children and adults. 

Yet Butterworth views such results as consequences, not causes, 
of the poor numerical abilities that characterize dyscalculia. He 
argues that another cognitive capacity is even more fundamental 
to number sense. He calls this ‘numerosity coding’: the under- 
standing that things have a precise quantity associated with them, 
and that adding or taking things away alters that quantity. 

But Stanislas Dehaene, a cognitive neuroscientist who studies 
numerical cognition at INSERM, France’s national institute for 
research on medicine and health, near Paris, sees number sense as 
being supported by a broader set of cognitive features. Approxima- 
tion and a sense of small numbers, while critical, are not enough 
for humans to precisely grasp large numbers, he says. Language, 
he argues, empowers humans to integrate the two number systems 
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WHICH NUMBER IS LARGER? 


These tests are used to determine whether a person's problems with 
numbers are due to dyscalculia or to other cognitive deficits. 


3 8 7, 7 


WHICH IS TALLER? WHICH IS GREATER? 


People with dyscalculia Dyscalculics take longer and are less 
answer this question just accurate at answering this question. 
as quickly and accurately They have even more trouble when 
as people without a the difference between the two 
learning disability. numbers is small. 


— giving them the ability to intuitively distinguish, say, 11,437 
from 11,436. Butterworth’s concept of numerosity coding may be 
an important part of number sense, says Dehaene, but there is still 
much to learn about it — for instance, whether it is present in other 
animals or in children from a very early age. 

One of Butterworth’s favourite papers is titled ‘Six does not just 
mean a lot: preschoolers see number words as specific”®. In it, the 
developmental psychologist Barbara Sarnecka, now at the Univer- 
sity of California, Irvine, and Susan Gelman, at the University of 
Michigan in Ann Arbor, showed that young children who could 
not yet count past two nonetheless understood that adding pen- 
nies to a bowl containing six somehow altered its number, even 
if the children couldnt say exactly how. If numerosity coding is 
fundamental, it predicts that dyscalculics such as Moorcraft or 
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Christopher struggle to enumerate and manipulate all numbers, 
large and small. Butterworth hopes that, by honing this ability, 
the Number Sense games will help support his research ideas. 

Three months on, Christopher seems to be faring better at the 
number-line game, going so quickly that Babtie asks him to slow 
down and explain his reasoning for each move. Babtie says that 
dyscalculic children tend to learn much more quickly when they 
talk through what they do. She also believes that Christopher's 
maths anxiety, a near-universal trait of child and adult dyscalcu- 
lics, is fading. 

He moves on to a Tetris-like game called 
Numberbonds, in which bars of different 
lengths fall down the screen, and he is asked 
to select a block of the correct size to fill out 
arow. This emphasizes spatial relationships, 
which some dyscalculics also struggle with. 
The blocks move too quickly at first, frustrat- 
ing Christopher, but he soon gets the hang of 
it, and when Babtie suggests he stop for the 
day, he begs for ten more minutes. 

The Number Sense games, including a 
snazzy-looking iPhone version of Number- 
bonds, are intended to nurture the abilities 
that, Butterworth contends, are the root of 
numerical cognition and the core deficit of 
dyscalculia — manipulating precise quan- 
tities. In a game called Dots to Track, for 
example, children must ascribe an Arabic 
numeral to a pattern of dots, similar to those on dice. When they 
enter the wrong value — and they often do — the game asks the 
children to add or remove dots to achieve the correct answer. 

As the summer holidays approach, Babtie is worried that Chris- 
topher and the other students she has been working with won't 
practise the games at home, returning in the autumn the worse 
for it. But in early October when school is back, Christopher 
announces that he will challenge himself with a number line that 
stretches from 950 to 9,000, “if you'll allow me’, he adds. At first he 
flounders, but quickly starts to understand the game and locates a 
string of four-digit numbers, beaming with each correct response. 

Other students are improving more slowly, but it is not easy to 
say why. Dyslexia, attention deficit hyperactivity disorder and 
autism spectrum disorder are common among dyscalculics, and 
it can be difficult to untangle these problems, says Babtie. The 
nine-year-old who counted on his fingers nine months ago can 
now deal with numbers below 6, but still struggles to distinguish 
9 from 10. Yet with the right practice and attention from teach- 
ers and parents, dyscalculic children can thrive, says Babtie, who 
emphasizes that computer games are a supplement, nota replace- 
ment, for one-on-one tutoring. 

Butterworth knows that it will take a controlled evaluation of 
Number Sense before he can say if the game genuinely improves 
numeracy in dyscalculic children. Small studies of other computer- 
based interventions hint that they might help. Dehaene reported 
in 2009 that Number Race, a game his group developed, mod- 
estly improved the ability of 15 dyscalculic kindergarten children 
to discern the larger of two numbers, but that it had no effect on 
their arithmetic or counting’. Meanwhile, a Swiss team reported 
in 2011 that a game that involves placing a spaceship on a number 
line helped eight- to ten-year-old dyscalculics with arithmetic. The 
researchers also studied the children in an fMRI scanner during a 
task that involved arranging numbers. They found that one month 
after training, the children showed increased activation in the 
intraparietal sulcus and reduced neural activation elsewhere in the 
parietal lobes — a hint that their improvements in arithmetic were 
related to changes involving brain areas that respond to number"”. 

Butterworth hopes to monitor the brains of students such as 


SCHOOL. 


A MATHS 


“EVERY DAY 
THEY GO TO 


DAY THERE’S 


CLASS. EVERY 
DAY THEY’RE 
SHOWN UP TO BE 
INCOMPETENT.” 


Christopher as they practise Number Sense, to see if their pari- 
etal lobes are indeed changing. But he has been turned down by 
every funding source he has applied to. Although dyscalculia, 
like other learning disabilities, takes a toll on productivity (one 
report estimated that low numeracy costs the United Kingdom 
£2.4 billion (US$4 billion) per year, mostly in lost wages) it 
doesn’t attract much attention or money. In the United States, 
for example, the National Institutes of Health spent $2 million 
studying dyscalculia between 2000 and 2011, compared with 
more than $107 million on dyslexia. 

Butterworth’s team now has tentative 
plans to evaluate its software with research- 
ers at the Cuban Neurosciences Center and 
the University of Pedagogical Sciences in 
Havana next year, and the group is also 
placing the game in other countries, includ- 
ing China and Singapore. “The Cubans, 
curiously, are putting money into this, even 
though they've got very little,’ Butterworth 
says, commending the strength of the 
country’s education system. 

Although an emeritus professor, tech- 
nically retired, Butterworth continues to 
research the neurodevelopmental roots of 
number sense, recently showing that gup- 
pies", like humans, possess approximate 
and precise number systems, and that 
dyscalculic adults have no more trouble 
telling the time than numerate people”. 

He hopes that Number Sense — if it can improve dyscalculia 
— will help him in the academic debate over the cognitive basis 
of numeracy. But Dehaene, probably his most fervent opponent 
in that debate, isn’t counting on classroom computer games to 
resolve it. His Number Race game and its successor, Number 
Catcher, incorporate a multitude of numerical skills, so even if 
the game works, it won't address the theoretical differences about 
which skills are most essential to number sense or most com- 
promised in dyscalculics. “I quickly realized that the interest of 
the children was to have a fun game full of ideas and variety, and 
that was not very compatible with an analytic approach,’ he says. 

Butterworth, too, says that he is ultimately more motivated by 
helping children. In the course of his studies, he was struck that 
children “were very, very distressed by being bad at maths. So 
every day they would go to school, every day there’s a maths class, 
every day they're shown up to be incompetent in a way other kids 
in their class are not’, he says. 

Moorcraft can commiserate. When he occasionally meets dys- 
calculic children, he tells them that he, too, counts with his fingers 
under the table, that they have nothing to be embarrassed about and 
that, with the practice that he never got, they can get up to speed. 

Moorcraft is also completing a book on dyscalculia with one of 
Butterworth’s postdocs. “I have written an introduction,’ he says. 
“ just hope the chapters are in the right order. m 


EVERY 


Ewen Callaway writes for Nature from London. 
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BY NICOLA JONES 


massive chunk of the Mauna Loa 

volcano cracked away from Hawaii 
and slid into the sea, launching a wave that rose as high as the Eiffel 
tower up the slopes of a nearby island. That mega-tsunami was not an 
isolated incident: the past 40,000 years have seen at least ten gigantic 
landslides of more than 100 cubic kilometres in the North Atlantic ocean 
alone, each capable of producing waves tens to hundreds of metres high. 
Another is bound to happen sometime — although whether it will strike 
tomorrow or 10,000 years from now is anyone's guess. 

This week, the World Economic Forum published its 2013 global 
risks report, which includes a section, produced in collaboration with 
Nature, on X factors: low-probability, high-impact risks resulting 
mainly from human activity (see go.nature.com/outhzr). But the natu- 
ral world holds unpredictable threats as well. The geologic record is 
peppered with evidence of rare, monstrous disasters, ranging from 
asteroid impacts to supervolcanoes to y-ray bursts. Nature looks into 
some of the life-shattering events that Earth and the broader Universe 
could throw our way. 


0 ne hundred thousand years ago, a 


DEATH BY VOLCANO 

Earth is now in the middle ofa flare-up of supervolcanic activity’. Over 
the past 13.5 million years, no fewer than 19 giant eruptions have each 
spewed more than 1,000 cubic kilometres of rock — enough to coat 
an entire continent in a few centimetres of ash and push the planet 
into ‘nuclear winter’. One of the most recent such eruptions, of Toba 
in Indonesia 74,000 years ago, was such a catastrophic event that some 
scientists have blamed it for starting the last ice age and slashing the 
human population to about 10,000 people. One estimate’ suggests that 
there is a 1% chance ofa super-eruption in the next 460-7,200 years. 

The four youngest, most active supervolcanic systems in the world 
are Toba, Campi Flegrei in Italy, Yellowstone in the northwestern United 
States and Taupo in New Zealand. All four systems are being monitored 
for groundswell and seismic swarms — clusters of small earthquakes that 
can signal moving magma — and all occasionally show these warning 
signs. But no one knows whether the result of each flare-up will be a small 
squirt of steam or — much more hazardous — a mega-eruption of lava. “If 
something were brewing, we would get warning hours, days and months 
ahead,’ says Shan de Silva, a volcanologist at Oregon State University in 
Corvallis. “But how big it's going to be, we don't have a handle on” 

To help answer these questions, scientists are now drilling into the 
heart of one of the top contenders for the next blow-up: the Campi Flegrei 
caldera, a crater that is 13 kilometres wide and includes the city of Naples. 
Since 1969, the ground at Campi Flegrei has bulged upwards by as much 
as 3.5 metres, and researchers are eager to find out whether the culprit 
is underground steam or a pool of magma. Previous bouts of volcanic 
activity in the caldera came after the ground surface had swelled up by 
several metres or more’, and researchers think that major activity could 
occur within the next few decades or centuries. To investigate the risk, 
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scientists at Campi Flegrei plan to drill more than 3 kilometres into the 
crater, despite concerns from some researchers that the drilling could 
trigger earthquakes or an explosion. 

One goal is to look at the magma pool beneath the crater: the 
shallower and more molten it is, the greater the chances of a super- 
eruption. Characterizing such pools through seismic studies is hard, 
and the range of error is huge. “We really are groping in the dark,” says 
de Silva. Scientists estimate that 10-30% of the magma under 
Yellowstone, for example, is liquid — shy of the 50% thought to 
be needed for super-eruption. But pockets of molten magma in 
the chamber could still cause eruptions several-fold larger than 
the 1980 blast from Mount St Helens in Washington state, warns 
Jacob Lowenstern, head of the Yellowstone Volcano Observatory for 
the US Geological Survey in Menlo Park, California. 

The effort to drill into Campi Flegrei and measure features 
such as temperature and rock permeability should help research- 
ers to interpret seismic-imaging studies of magma pools, says 
Lowenstern. “If we want to be able to successfully image Earth, we 
occasionally need to make a few strategic incisions into the patient,” 
he says. As for the dangers of drilling, Lowenstern is convinced 
that the project will have minimal impact. “It’s like a pinprick on 
an elephant,’ he says. The Campi Flegrei team finished an initial 
500-metre test well in December 2012 without incident. And seismol- 
ogists safely drilled a hole of similar size into the Long Valley caldera 
in California — a supervolcano site that erupted 760,000 years ago and 
holds the same killer potential as Yellowstone. 

Until more is learned about these systems, societies must accept 
that the threat of a super-eruption is real, yet remote. Lowenstern says 
that although the chances of one happening this year are tiny, “it is 
theoretically possible”. 


DEATH BY FUNGUS 

Although viruses and bacteria grab more attention, fungi are the planet’s 
biggest killers. Of all the pathogens being tracked, fungi have caused 
more than 70% of the recorded global and regional extinctions’, and 
now threaten amphibians, bats and bees. The Irish potato famine in the 
1840s showed just how devastating such pathogens can be. Phytophthora 
infestans (an organism similar to, and often grouped with, fungi) wiped 
out as much as three-quarters of the potato crop in Ireland and led to 
the death of one million people. 

Potato blight is still a threat: 13_A2, a highly aggressive strain of 
P. infestans, is now rampant in Europe and North Africa. Across the 
globe, Phytophthora causes some US$6.7 billion in annual damages, 
according to a 2009 estimate*. Sarah Gurr, a plant pathologist at the 
University of Oxford, UK, estimates that the worst theoretical potato 
infestation would deprive 1.3 billion people of food each year. Other major 
staple crops face similar threats, such as rice blast (Magnaporthe oryzae), 
corn smut (Ustilago maydis), soya bean rust (Phakopsora pachyrhizi) and 
wheat stem rust (Puccinia graminis). The stem-rust superstrain Ug99 has 
in recent years slashed yields in parts of Africa by as much as 80%. 

If all five crop staples were hit with fungal outbreaks at the same 
time, more than 60% of the world’s population could go hungry, says 
Gurr. “That’s apocalyptic’, but unlikely, she says — “more of a James 
Bond movie”. David Hughes, a zoologist at Pennsylvania State Univer- 

sity in University Park, adds that terrorists could 
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unproductive farms to the cities and cleared more rainforest. “If you 
wanted to destabilize the world, you could easily introduce rubber blight 
into southeast Asia,” he says, which would trigger a chain reaction of 
economic and political effects. 

Modern agriculture has exacerbated societies’ vulnerability by 
encouraging farmers to plant the same strains of high-yield crops, 
limiting the variety of resistance genes among the plants, says Gurr. 
“We've skewed the arms race in favour of the pathogen,’ she says. “That's 
why were on the brink of disaster.” 

Researchers estimate that there are 
1.5 million to 5 million species of fungi 
in the world, but only 100,000 have been 
identified. Reports of new types of fungal 
infection in plants and animals have risen 
nearly tenfold since 1995 (ref. 3). Gurr sug- 
gests that climate change might be a culprit. 

Humans have cause for concern as well. 
In the past decade, a tropical fungus called 
Cryptococcus gattii has adapted to thrive 
in cooler climes and invaded the forests 
of North America’s Pacific Northwest. By 
2010, it had infected some 280 people, dozens of whom died. Although 
fungi are not spread as easily from person to person as viruses, for exam- 
ple, and anti-fungal agents can effectively tackle most infections, there 
are still reasons to worry. Fungi continue to evolve, and once they are 
established in an ecosystem, they can be almost impossible to wipe out. 

Given these trends, experts say that fungi have not received enough 
attention from researchers and governments. “I'd be very surprised if an 
abrupt fungal infection killed a large swathe of people. But it’s not impos- 
sible,” says Matthew Fisher, an emerging-disease researcher at Imperial 
College London. “Complacency is not a recommended course of action.” 


DEATH FROM ABOVE 

The heavens hold plenty of threats. The Sun occasionally launches 
outsize solar flares, which fry electricity grids by generating intense 
currents in wires. The most recent solar megastorm, in 1859, sparked 
fires in telegraph offices; today, a similarly sized storm would knock out 
satellites and shut down power grids for months or longer. That could 
cause trillions of dollars in economic damage. 

A solar flare some 20 times larger than that may have hit Earth in 774, 
according to Adrian Melott, a cosmologist at the University of Kansas in 
Lawrence, and Brian Thomas, an astrophysicist at Washburn University 
in Topeka, Kansas. “That’s not an extinction event,” says Melott, “but for 
a technological civilization, it could kill hundreds of millions of people 
and set us back 150 years.” Fortunately, there are ways to mitigate this 
worst-case scenario should it occur: engineers can protect the grid with 
fail-safes or by turning off the power in the face of an incoming blast. 

Next up the scale of disaster magnitude is a large comet or asteroid 
strike. Sixty-five million years ago, an asteroid 10 kilometres wide hit 
Earth and triggered the end-Cretaceous mass extinction; 2-kilometre 
rocks, thought to be capable of causing extinctions on a smaller scale, 
smack the planet once or twice every million years. Astronomers are hard 
at work tallying and tracking asteroids in Earth’s vicinity, and scientists 
are investigating ways to divert any real threats that might materialize. 

A far rarer danger — and one that could not be avoided — is the blast 
of radiation from a nearby y-ray burst. Perhaps the most frightening 
of these celestial explosions is the ‘short-hard’ y-ray burst, caused by 
the violent merger of two black holes, two neutron stars or a combina- 
tion. If one such blast were directed at Earth from within 200 parsecs 
away (less than 1% of the distance across the Milky Way), it would zap 
the globe with enough high-energy photons to wipe out 30% of the 
atmosphere’s protective ozone layer for nearly a decade’. That sort of 
event — expected once every 300 million years or so — would double 
the amount of ultraviolet (UV) light reaching the ground and scorch 
phytoplankton, which make up the base of the ocean's food web. 

Astronomers have no way of knowing whether such a rare event is 
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imminent. Neutron stars are small and dark, so there is no catalogue 
of those within striking distance. “We wouldnt see it coming,” says 
Thomas. In as-yet-unpublished work, he estimates that such an event 
could cause a 60% increase in UV damage to crops, with up to 60% 
reduction in crop yields. 

From a distance of about 2,000 parsecs, ‘long-soft’ y-ray bursts — which 
result from the collapse of massive stars — could also cause extinctions. 
But these events are rarer than short-hard bursts, and easier to spot in 
advance because they come from larger, 
brighter stars. The two-star system WR 104 
is some 2,500 parsecs away from Earth, and 
is far enough along in its life cycle that it is 
expected to explode some time in the next 
few hundred thousand years — although the 
beam from the burst is unlikely to hit Earth. 

It is possible that a y-ray blast has hit the 
planet before. Melott, Thomas and their 
colleagues have suggested that the mass 
extinction at the end of the Ordovician 
period, 440 million years ago, could have 
been triggered by a y-ray blast that wiped 
out some species through UV exposure and killed off others by creat- 
ing a sunlight-blocking haze of nitrogen dioxide®. This would explain 
why some species went extinct before the globe cooled during that 
period, and it fits the extinction pattern, which shows that among marine 
organisms, the greatest toll was on plankton and other life in the upper 
part of the ocean. 

Thomas says that none of these potential disasters is keeping him up 
at night. He does, however, “have some canned food in the basement” — 
a prudent backup in the event of any disaster. 


DEATH BY WATER 

Eight thousand years ago, sediments covering an underwater area 
the size of Scotland slipped from their moorings off the west coast of 
Norway and raced along the sea floor. The Storegga slide triggered a 
tsunami that ran at least 20 metres up the nearby Shetland Islands, and 
probably wiped out some coastal tribes as it clobbered shores around 
northern Europe. The scar it left on the ocean floor stretches nearly 
300 kilometres. “It’s absolutely enormous, and I’m not using the word 
‘enormous lightly,’ says Peter Talling, a sedimentologist at the Univer- 
sity of Southampton, UK, who is leading a project to assess the country’s 
risk of similar slides. 

The United Kingdom is not the only country concerned about giant 
submarine landslides. “There are definitely areas that have potential? 
says Uri ten Brink, a geophysicist at the US Geological Survey in Woods 
Hole, Massachusetts, who conducted a 2008 study of possible sources 
of tsunamis on the US east coast, where some nuclear power plants are 
within striking distance of such waves. “There are far larger piles of sedi- 
ment around today than Storegga ever was,’ ten Brink says, including 
deposits along the coast of southern Alaska and off the Amazon, Niger 
and Nile river deltas. Smaller slides are more probable and can still have 
a huge local impact — and they often strike without warning. In 1998, 
a relatively small (magnitude-7) earthquake triggered an underwater 
slide that launched a 15-metre-high tsunami into Papua New Guinea, 
killing 2,200 people. 

Researchers say that it is hard to quantify the threat of marine slides, 
particularly the giant ones. “There is so little information about events 
that happen so rarely,” says ten Brink. “We just have to learn as much as 
we can.” m SEEEDITORIALP. 134 


Nicola Jones is a freelance writer near Vancouver, Canada. 
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Funding agency 
rewards outputs other than 
papers at last p.159 


How science flows 
through Herman Melville’s 
Moby-Dick p.160 


centre stage p.162 
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puts inventors’ personalities 


US science agencies 
still worried about deferred 
spending cuts p.163 
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The shores of Rodrigo de Freitas Lake in Brazil are littered with fish killed by algal overgrowth in M 
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arch 2000. 


From patterns to predictions 


Truly generic signals warning of tipping points are unlikely to exist, warn Carl Boettiger 
and Alan Hastings, so researchers should study transitions specific to real systems. 


r Vhere has been much talk about tipping 
points over the past few years, and 
about the warning signals that may 

precede them. You could be forgiven for 
thinking that the forecasting of epidemics 
and stock-market crashes is just around the 
corner. But no one has yet managed to use 
the theory on early warning signals to pre- 
dict a natural catastrophe. 

The rewards of bridging the gap between 
the real world and mathematical concep- 
tualizations of catastrophic shifts would 
be vast. Climate scientists might be able to 
foresee major shifts in the ocean currents 
with a rise in global temperatures; ecologists 


could potentially stave off pest outbreaks; 
and policies might be implemented to avert 
the collapse of fisheries’. (A report out this 
week from the World Economic Forum 
outlines other risks facing the world’). But 
for such applications to emerge, research- 
ers should resist the lure of general rules. 
We must instead use all the available data to 
develop tools to study the specific properties 
of real systems. 

The idea that critical transitions should 
be preceded by warning signals dates as far 
back as 1984 (ref. 3). In the past six years 
or so, such signals have been detected in 
various experiments. For instance, when 


yeast (Saccharomyces cerevisiae) is grown 
in chambers, the populations show some 
recovery after each drop in their nutrient 
supply as they adapt to the new conditions*. 
But this ‘resettling’ takes longer as the com- 
munity approaches a crash. Also, the sizes 
of populations across chambers can become 
more variable in space and time’ (see ‘Pre- 
dictors of doom). 

Such warning signals are tantalizingly 
general. Researchers have seen the same 
dynamics — a slower return to equilib- 
rium and increased variance — elsewhere’, 
including in algal overgrowth in lakes, the 
end of ice ages (tracked by measurements 
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Forests in Colorado are being pushed towards collapse by pine-beetle infestations. 


> of the dust and atmospheric gas in ice 
cores) and the onset of epilepsy (signalled 
by a sudden change in the electrical activity 
of the brain). 

However, no ‘one-size-fits-all’ property 
has been found that signals the imminent 
collapse of a complex system. Hypothetical 
systems can flip into different states with- 
out exhibiting either increased variance or 
a slower return to equilibrium®”. And other 
models display exactly the opposite of what 
is expected. For instance, a model based on 
the life-history patterns of animals such as 
salamanders, which spend the juvenile and 
adult stages of their life cycles in different 
habitats, suggests that progressive habitat 
fragmentation can lead to a decrease in the 
variance of local population sizes’. 

Much effort is being dedicated to finding 
‘generic’ warning signals that apply across 
diverse systems. But because the phenom- 
ena identified so far are not universally 
associated with tipping points, nor even 
sure indicators ofa major shift, their predic- 
tive power is uncertain. We believe that in 
most cases, models designed to predict when 
critical transitions will happen, and in what 
circumstances, will need to be guided by — 
and perhaps even generated from — data on 
the specific system of interest. 


IMPROVING PREDICTIONS 
Encouragingly, even a small amount of 
system-specific information can dramati- 
cally improve researchers’ ability to predict 
a collapse in simulations. For instance, by 
incorporating basic knowledge about the 
birth and death of North Atlantic cod into 
their models, ecologists were able to double 
the accuracy with which they predicted an 
approaching collapse of the population®. 
Also encouraging is that remote-sensing 


tools are flooding researchers with gigabytes 
of data’. Satellite images, for instance, pro- 
vide detailed information about the health 
(and in some cases, species composition) 
of vast swathes of forests, including the 
Amazon rainforest and boreal forests. And 
electronic data loggers, such as continuous- 
monitoring floats, are providing detailed 
information on the physical and biological 
properties of the world’s oceans. 

The multidimensional data that these 
tools provide could lead to modelling 
approaches with the specificity and statisti- 
cal power necessary for real-world applica- 
tions. An as yet untested approach is to plug 
whatever data are available into machine 
learning algorithms to find out what, if 
any, warning signals manifest when certain 
parameters are adjusted. 

Of course, to establish whether a candidate 
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signal is outside the normal range, research- 
ers need to know how the ecosystem, fault 
line, lake or whatever it is behaves under ‘nor- 
mal’ circumstances. Yet controls or replicates 
are rarely available for natural systems. 

One option is to compare the current 
properties of a system with historical or ‘base- 
line’ data. For such baselines to be effective, 
vast amounts of data must have been collected 
well before the system nears the tipping point. 

Such an approach needs exploring; we 
don’t yet know whether it will even be 
possible to obtain baseline data for many 
complex systems. And for ecological sys- 
tems, comparisons will often be confounded 
by undocumented changes in data-collection 
procedures, a switch in management practice 
or a shift in environmental conditions. 

Another way around the lack of controls 
and replicates, and one that has received less 
attention, is to model (rather than measure) 
the expected behaviour of the stable system”. 
In this scenario, various data, including, say, 
real recordings of the density of invertebrates 
ina lake, could be fed into a model. In one 
simulation, a certain parameter, such as the 
number of predators, would be kept constant; 
in the other, it would be steadily increased 
to destabilize the food chain. By comparing 
various properties of the two systems, model- 
lers could then establish the predictive power 
of candidate early warning signals. 

Both theoretical’ and experimental** 
approaches have shown that measurable 
signs of impending transitions do exist’. The 
challenge now is to resist the seductive but 
elusive concept of ‘generic warning signals, 
and focus instead on making predictions 
about real systems using readily available, or 
easily collectable, data. m 
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Value all research products 


Anew funding policy by the US National Science Foundation represents 
a sea-change in how researchers are evaluated, says Heather Piwowar. 


hat a difference a word makes. 
For all new grant applications 
from 14 January, the US National 


Science Foundation (NSF) asks a principal 
investigator to list his or her research “prod- 
ucts” rather than “publications” in the bio- 
graphical sketch section. This means that, 
according to the NSF, a scientist’s worth is 
not dependent solely on publications. Data 
sets, software and other non-traditional 
research products will count too. 

There are more diverse research products 
now than ever before. Scientists are develop- 
ing and releasing better tools to document 
their workflow, check each other’s work 
and share information, from data reposito- 
ries to post-publication discussion systems. 
As it gets easier to publish a wide variety of 
material online, it should also become easy 
to recognize the breadth ofa scientist's intel- 
lectual contributions. 

But one must evaluate whether each prod- 
uct has made an impact on its field — from a 
data set on beetle growth, for instance, to the 
solution to a colleague's research problem 
posted on a question-and-answer website. So 
scientists are developing and assessing alter- 
native metrics, or ‘altmetrics’ — new ways to 
measure engagement with research output. 

The NSF policy change comes at a time 
when around 1 in 40 scholars is active on 
Twitter’, more than 2 million researchers use 
the online reference-sharing tool Mendeley 
(see go.nature.com/x63cwe), and more than 
25,000 blog entries have been written about 
peer-reviewed research papers and indexed 
on the Research Blogging platform’. 

In the next five years, I believe that it will 
become routine to track — and to value — 
citations to an online lab notebook, contri- 
butions to a software library, bookmarks to 
data sets from content-sharing sites such as 
Pinterest and Delicious. In other words, to 
value a wider range of metrics that suggest a 
research product has made a difference. For 
example, my colleagues and I have estimated 
that the data sets added to the US National 
Center for Biotechnology Information’s 
Gene Expression Omnibus in 2007 have 
contributed to more than 1,000 papers™. 
Such attributions continue to accumulate 
for several years after data sets are first made 
publicly available. 

In the long run, the NSF policy change will 
do much more than just reward an investi- 
gator who has authored a popular statistics 
package, for instance. It will change the 


game, because it will alter how scientists 
assess research impact. 

The new NSF policy states: “Acceptable 
products must be citable and accessible 
including but not limited to publications, 
data sets, software, patents, and copyrights.” 
By contrast, previous policies allowed only 
“patents, copyrights and software systems” 
in addition to research publications in the 
biography section ofa proposal, and consid- 
ered their inclusion to bea substitute for the 
main task of listing research papers. 

Still, the status quo is largely unchanged. 
Some types of NSF grant-renewal appli- 

cations continue 


“Altmetrics to request papers 
give afuller alone. Indeed, several 
picture of funders — includ- 
howresearch _ ing the US National 
products have Institutes of Health, 
influenced the Howard Hughes 
conversation, Medical Institute 
thought and and the UK Medical 


Research Council — 
still explicitly ask for 
alist of research papers rather than products. 

Even when applicants are allowed to 
include alternative products in grant applica- 
tions, how will reviewers know if they should 
be impressed? They might have a little bit 
of time to watch a short video on YouTube 
demonstrating a wet-lab technique, or to 
read a Google Plus post describing a compu- 
tational algorithm. But what if the technique 
takes more time to review, or is in an area 
that is outside the reviewer's expertise? Exist- 
ing evaluation mechanisms often fail for 
alternative products — a YouTube video, for 
example, has no journal title to use as a proxy 
for anticipated impact. But it will definitely 
receive a number of downloads, some ‘likes’ 
on Facebook, a few Pinterest bookmarks and 
discussion in blogs. 


behaviour.” 


TRACKING TRENDS 
Many altmetrics have already been gathered 
for a range of research products. For exam- 
ple, the data repositories Dryad and figshare 
track download statistics (figshare is sup- 
ported by Digital Science, which is owned 
by the same parent company as Nature). 
Some repositories, such as the Inter-uni- 
versity Consortium for Political and Social 
Research, provide anonymous demographic 
breakdowns of usage. 

Specific tools have been built to aggregate 
altmetrics across a wide variety of content. 


Altmetric.com (also supported by Digital 
Science) reveals the impact of anything 
with a digital object identifier (DOJ) or other 
standard identifier. It can find mentions of 
a data set in blog posts, tweets and main- 
stream media (see go.nature.com/yche8g). 
The non-profit organization ImpactStory 
(http://impactstory.org), of which Iam a 
co-founder, tracks the impact of articles, 
data sets, software, blog posts, posters and 
lab websites by monitoring citations, blogs, 
tweets, download statistics and attribu- 
tions in research articles, such as mentions 
within methods and acknowledgements’. 
For example, a data set on an outbreak of 
Escherichia coli has received 43 ‘stars’ in the 
GitHub software repository, 18 tweets and 
two mentions in peer-reviewed articles (see 
go.nature.com/dnhdgh). 

Such altmetrics give a fuller picture of how 
research products have influenced conversa- 
tion, thought and behaviour. Tracking them 
is likely to motivate more people to release 
alternative products — scientists say that the 
most important condition for sharing their 
data is ensuring that they receive proper 
credit for it®, 

The shift to valuing broad research impact 
will be more rapid and smooth if more 
funders and institutions explicitly welcome 
evidence of impact. Scientists can speed the 
shift by publishing diverse research products 
in their natural form, rather than shoehorn- 
ing everything into an article format, and 
by tracking and reporting their products’ 
impact. When we, as scientists, build and use 
tools and infrastructure that support open 
dissemination of actionable, accessible and 
auditable metrics, we will be on our way to a 
more useful and nimble scholarly commu- 
nication system. m 


Heather Piwowar is a postdoctoral research 
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PLATE AN. 


Fig.7. BALZENA MYSTICETUS,or COMMON WHALE. 


Fig.2. CUB of the COMMON WHALE 17 Feet long. 


58 Feetlong.. 


The Mouth being: open shows the position of the Whalebone . 


Fig. 3. SA RWAL, Length exclusive of the Tusk 14 Feet. 


This 1820 plate is one of a number by William Scoresby, a whaler’s son who was the first person to draw and describe whales accurately. 


How science inspired Moby-Dick 


Philip Hoare tracks the scientific influences and insights that breach throughout 


Herman Melville’s epic novel. 


ore than a century and a half 
after it was published, Herman 
Melville's Moby-Dick remains a 


key cultural bridge between human history 
and natural history — expressed in the vast 
and ominous shape of the whale. This epic 
novel is a laboratory of literature, created in 
an age before art and science became strictly 
demarcated. 

Melville wrote his book — which drew 
on his own youthful experiences on a whal- 
ing ship — asa tribute to the first period of 
modern whaling in the eighteenth to mid- 
nineteenth centuries, which he claimed to 
be worth US$7 million a year to the fledgling 
United States. At the same time, science was 
undergoing a sea change as the gentleman 
scientists and polymaths of the century’s 
start gave way to more specialized and 
professionalized successors. 

Melville’s attitude to, and use of, science in 
Moby-Dick was in line with the eclectic ethos 
of that period. Drawing on the work of lumi- 
naries such as William Scoresby, Thomas 
Beale, Georges Cuvier and Louis Agassiz, 
Melville used contemporary knowledge of 
natural history — or the lack of it — to his 
own ends. 

Seventeen of the book’s 135 chapters 
focus on whale anatomy or behaviour. Titles 


Moby-Dick include ‘The Sperm 
HERMAN MELVILLE Whale’s Head — Con- 
ce & Brothers: trasted View and “The 


Right Whale’s Head — 
Contrasted View’; such sections lay out the 
whales’ physical structure with a wry mix- 
ture of known facts and arch analogy. (Ina 
witty 2011 essay, marine biologist Harold 
Morowitz speculates on Melville as a “ceta- 
cean gastroenterologist or proctologist”.) 
Melville’s must also be the first, and perhaps 
last, work of literature to feature a chapter 
on zooplankton. 

In the famous Chapter 32, ‘Cetology, Mel- 
ville attempts to categorize species of whale 
as he would catalogue his library, in ‘folios: It 
was a playful gesture that reflected the fluid 
classification of cetacean species at the time. 
In The Natural History of the Sperm Whale 
(1839), Beale notes that the French natural 
historian Bernard Germaine de Lacépéde 
claimed that there were eight species of this 
whale; there are in fact only three: Physe- 
ter macrocephalus; Kogia sima, the dwarf 
sperm whale; and the 
pygmy, K. breviceps. 
Accordingly, Melville 
pronounces earlier 
attempts to describe 
whales “all wrong”, 
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and deploys two contemporary authorities 
to bolster his claim: Scoresby and Beale. 
The Natural History of the Sperm Whale 
was the first attempt to write scientifically 
about this deep-diving, open-ocean whale. 
The result of Beale’s experiences as a surgeon 
on a British whaling ship, the book was full 
of observations on the animal’s anatomy 
and behaviour. Cuvier had claimed that the 
sperm whale struck fear into “all the inhabit- 
ants of the deep’, but Beale knew this whale 
to be “a most timid and inoffensive animal”. 
Equally, Scoresby’s groundbreaking An 
Account of the Arctic Regions (1820) gave 
Melville insight into the other cetacean 
whose numbers were decimated by whal- 
ing: the bowhead (Balaena mysticetus), then 
known as the common whale. Scoresby, the 
son ofa whaler, was a typical polymath of the 
time: hunter, scientist, clergyman and mes- 
merist. In his early career he had received 
encouragement from Joseph Banks, and his 
work set the benchmark for Arctic studies. 
Melville was particularly fascinated by 
Scoresby’s observations of an ancient Inuit 
harpoon embedded in a bowhead’s blub- 
ber. “Who had darted that stone lance?” 
Melville's narrator Ishmael wonders, imag- 
ining (with slight exaggeration) that it had 
been thrown “long before America was 


x 
g 


discovered”. Science indicates that Melville 
may not have been far wrong. In 1999, tests 
on bowheads indicated that these animals 
can live for at least 200 years. 

Of course, the greatest scientific figure of 
the age hovers over Melville. Darwin pub- 
lished On the Origin of Species in 1859, eight 
years after Moby-Dick came out. Melville’s 
sole mention of Darwin is a quote — from 
Darwin’ Voyage of a Naturalist (sic) — in the 
extracts at the start of Moby-Dick. He had 
read Darwin's Voyage of the Beagle (1839) 
in preparation for his own 1854 work, The 
Encantadas or Enchanted Isles — as the 
Galapagos were then known. Melville visited 
the islands in 1841, six years after Darwin's 
fateful landing. Darwin’s recorded observa- 
tion of marine iguanas as “imps of darkness” 
seemed to set the tone for Melville’s meta- 
phoric view of the Galapagos, which he saw 
as “five-and-twenty heaps of cinders ... Inno 
world but a fallen one could such lands exist”. 

Such dark analogies are in line with a man 
who declared all human science to be “but a 
passing fable” — and yet created a fable of 
his own. In Moby-Dick, Ishmael is a perpetu- 
ally sceptical and questioning figure, a man 
attuned to science — a stark contrast to the 
vengeful Ahab and his pursuit of the whale 
that “dismasted” him. As the critic Eric Wil- 
son, in his essay ‘Melville, Darwin, and the 
Great Chain of Being, notes, a “primary 
subtext of Melville’s novel is the passing of 
pre-Darwinian, anthropocentric thought, 
espoused by Ahab, and the inauguration of 

a version of Darwin's 


“Melville’s more ecological evo- 
masterpiece lution, proffered by 
resonates Ishmael”. 

powerfully Melville lived 
with today’s through that process. 
scientific US Transcendentalist 
concerns.” Ralph Waldo Emer- 


son’s essay Nature 
(1836), with its declaration of moral law at 
the heart of the cosmos, was the new philos- 
ophy of Melville’s youth. But as biographer 
Andrew Delbanco points out, Melville read 
A Hazard of New Fortunes (1890), William 
Dean Howells’s Darwinian-inflected view of 
society. Moby-Dick itself has been seen as a 
parody of the Transcendentalists’ ‘back-to- 
nature’ excesses. But Melville does more than 
lambast philosophy or use science as interior 
decoration. He achieved a marvellous syn- 
thesis of his own poetic and philosophical 
impulse with the increasingly science-aware 
ethos of his age. And he did so with a sense 
of black humour that transcended Tran- 
scendentalism to prove that nature — and 
its science — was much stranger and more 
wonderful than they had imagined. 
Moby-Dick failed to make any impact in 
Melville's lifetime, and he died forgotten in 
1891. But his spirit of enquiry and experi- 
ment stood him in good stead as far as > 


Books in brief 


— The Myth of Martyrdom: What Really Drives Suicide Bombers, 


THE MYTH Rampage Shooters, and Other Self-Destructive Killers 
Adam Lankford PALGRAVE MACMILLAN 272 pp. £16.99 (2013) 
MARTYRDOM Are suicide bombers psychologically normal? Many psychologists, 
including experts ‘diagnosing’ the hijackers responsible for the 
& | 11 September 2001 terrorist attacks, view them as just that, albeit 
"12 exercised by a powertul sense of justice. Adam Lankford begs to 
=e H differ. Self-destructive killers, he says, are already primed for suicide 
“ur — so depressed, addicted or brutalized that it is relatively easy to tip 
7 them over the edge. A criminal-justice specialist, Lankford presents 


compelling, well-synthesized evidence for his case. 


The White Planet: The Evolution and Future of Our Frozen World 
Jean Jouzel, Claude Lorius and Dominique Raynaud. Translated by 
Teresa Lavender Fagan PRINCETON UNIV. PRESS 316 pp. $29.95 (2013) 
Ice in all its chill Earthly manifestations has drawn thousands of 
research scientists into the white deserts of the world. Now, three 
pioneers of ice-core science — Jean Jouzel, Claude Lorius and 


“hen dtey Dominique Raynaud — reveal key facets of the cryosphere in a new 
a al translation of their sweeping overview. Moving from exploration 
ia j and early science, they delve into the ice ‘archives’ and findings 
? on climate ancient and current, the rise of pollution and more. A 


nuanced and thorough look at climate change and its implications. 


The Enlightenment Vision: Science, Reason, and the Promise of a 


a Better Future 
ENLIGHTENMENT Stuart Jordan PROMETHEUS 295 pp. $26 (2013) 
VISION Physicist Stuart Jordan scrutinizes the afterglow of that scientific 
~ big bang, the Enlightenment of the seventeenth and eighteenth 
| centuries. Aspects of today’s culture — medicine, scientific outlook, 
rae HO democracy and technological advances — carry traces of the 
Puta original vision. But Jordan shows too how mixed a legacy we face, 
j from ignorance about science, a bulging population and “juggernaut 
y technology” to degraded ecosystems. Particularly by upholding 


ethics, he argues, we can collectively turn the tide. 


Underwater Eden: Saving the Last Coral Wilderness on Earth 
Gregory S. Stone and David Obura UNIV. CHICAGO PRESS 184 pp. $40 
(2012) 

Ocean warming and acidification are bad news for corals, and more 
than one-quarter of fish species. So when Gregory Stone dived 
around the remote Pacific Phoenix Islands in 2002, he was stunned 
to see a ‘lost world’ of untouched coral beds. Here Stone, chief 
ocean scientist of Conservation International, coral researcher David 
Obura and contributors lay out what happened next: the hard-won 
creation of the largest World Heritage Site ever sanctioned by the 
United Nations Educational, Scientific and Cultural Organization. 


This Explains Everything: Deep, Beautiful, and Elegant Theories of 
How the World Works 

John Brockman HARPER PERENNIAL 432 pp. $15.99 (2013) 

Agent to the stars of science, John Brockman presents mind-bites 
from his stable of research heavyweights asked to name their 
“favourite deep, elegant, or beautiful explanation”. Try theoretical 
physicist Freeman Dyson speculating on the putative coexistence 

of quantum and classical world views, or mathematician Samuel 
Arbesman admiring the reaction—diffusion model that dictates a 
leopard’s spots. 
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Moby-Dick author Herman Melville. 


> literary immortality is concerned. 
His allusive style chimed with a new 
century of discovery, and twentieth-cen- 
tury experimentalists of literature such 
as D. H. Lawrence and Virginia Woolf 
reappraised him as a modernist who lived 
before modernism was invented. 

Melville’s masterpiece also resonates 
powerfully with today’s scientific con- 
cerns. Moby-Dick contrasts the glory 
of the whale with the threats posed by 
humanity. Melville even seems to antici- 
pate the effects of a changing environ- 
ment. In the moving chapter “Does The 
Whale’s Magnitude Diminish? — Will He 
Perish?’ Melville wonders about a flooded 
future, but sees the whale as triumphant, 
spouting “his frothed defiance to the 
skies”. Yet by the time his book finally 
came into its own, Melville’s vision had 
turned into a nightmare for the whale. 

In 1961 alone, more whales died — 
nearly 75,000 — than in the entire span 
of Yankee whaling. With faster ships 
and grenade harpoons, new species had 
come within the hunters’ remit: the blue 
and fin whales of the South Atlantic and 
Southern Ocean. And, like Scoresby, the 
“hip-booted cetologists” (as D. Graham 
Burnett describes them in his The Sound- 
ing of the Whale: Science and Cetaceans 
in the Twentieth Century, University of 
Chicago Press, 2012), entered a complicit 
arrangement with the modern whaling 
industry to inform their conclusions on 
whale anatomy, breeding and migration. 
It is telling, perhaps, that no one has writ- 
ten a follow-up to Moby-Dick to celebrate 
that particular adventure. m 


Philip Hoare is the author of Leviathan, 
or, The Whale. His new book, The Sea 
Inside, is due out in June. He is co-curator 
of the Moby-Dick Big Read (www. 
mobydickbigread.com), an online project 
hosted by Plymouth University, UK. 
e-mail: phoare@themovement.demon. 
co.uk 
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INNOVATION 


Motley inventors 


John Browning welcomes a collective portrait of 
creators that puts life stories first. 


nenthusiast’s book about enthusiasts, 
At Tinkerers surveys a motley collec- 

tion of US innovators whose creations 
are changing the world — or so their makers 
hope. The result is a kaleidoscopic view of the 
myriad forms innovation can take. 

Alec Foege’s book is a useful contribution 
to understanding our era, repeatedly trans- 
formed by innovation that has generated 
pages and gigabytes of analysis in the abstract, 
but surprisingly little on the inventors them- 
selves. Foege redresses the balance, covering 
a crew that ranges from Thomas Edison to 
Silicon Valley whizz-kids. He tries to let his 
inventors describe in their own words what 
they thought they were doing, and why. 

Dean Kamen, inventor of the Segway and 
various medical devices, falls closest to the 
mould set by Edison. Based in New Hamp- 
shire, he has used the cash flow from licens- 
ing his automatic syringe, dialysis machine 
and other medical innovations to invent 
solutions to whatever takes his fancy — such 
as a wheelchair that can navigate steps. Saul 
Griffith puts a more modern spin on the 
model. His “do tank’ Squid Labs in Alameda, 
California, spins off companies rather than 
licences — including Howtoons, a website 
featuring educational cartoons, and Makani 
Power, which uses airborne wind turbines to 
harvest the energy of high-altitude winds. 

Meanwhile, Nathan Myhrvold, Microsoft's 
former head of technology, takes new models 
of innovation to a logical extreme. Instead of 
creating his own inventions, his company, 
Intellectual Ventures, buys, finances and 
creates a large patent portfolio, which it then 
licenses to others. He argues that this inspires 
a broad swathe of creativity. Some, in turn, 
argue that it encourages a profusion of pat- 
ents and could merely mire innovation in 
legal uncertainties and lawsuits. 

Foege doesn't provide any solutions, or 
even take a strong stance in that debate. For 
better or worse, The Tinkerers feels like a 
prototype — a bit rough and ready, created 
as much to point towards interesting ques- 
tions as to provide answers. To be truly repre- 
sentative, the choice of profiles should at least 
have included some innovators in biology or 
medical science, and some working in cor- 
porate laboratories. But the hotchpotch does 
capture a range of approaches and motiva- 
tions. Kamen seems to like solving problems. 
Griffith wants to create a greener, smarter 
world. And although he contributed to the 
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underpinnings of 
Microsoft Windows, 
Myrhvold now sees 
his role in promoting 
innovation as largely 
financial and legal. 

Foege’s approach 
contrasts with the 
books on innovation that take the ‘grand 
theory of creativity’ route, knocking the 
sharp edges off individual histories to fit 
them in. Steven Johnson’s Where Good Ideas 
Come From (Riverhead, 2010) fits inspiration 
into seven categories, including hunches and 
happy accidents. Clayton Christensen’s The 
Innovator’s Dilemma (Harvard Business 
School Press, 1997) divvies up the world of 
innovation according to its impact on busi- 
ness strategy. Fascinating though such books 
often are, this abstract point of view was more 
valuable 50 years ago than today. 

Then, innovation was managed. Much 
of it happened in government-funded set- 
ups such as Bell Labs and DARPA (the 
US Defense Advanced Research Projects 
Agency), or at big corporate-funded labs like 
IBM and Dow Chemical. President Dwight 
D. Eisenhower, in his 1961 farewell address, 
worried that “a government contract becomes 
virtually a substitute for intellectual curiosity”. 
And so powerful were the managers of that 
funding that he followed his warning about 
the military-industrial complex with another 
about the “scientific-technological elite”. 

Nearly 30 years ago, Steven Levy’s book 
Hackers (Anchor/Doubleday, 1984) inspired 
a generation with the idea that ideas born 
of subversion are often more powerful than 
those blessed by management. Today, innova- 
tion is popping up all over the place: garages; 
‘hack-spaces’; bedrooms. Nobody is manag- 
ing the process. So everybody's story matters. 

Few of those stories have yet been told, 
making those that have the exceptions. One 
example, Walter Isaacson’s biography of 
Steve Jobs, sparked a global debate about 
how to manage innovation grounded in 
life rather than theory. But the exceptions 
should become the rule. Innovators’ his- 
tories need to be written. For all its flaws, 
Foege’s book is a step in the right direction. m 
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Time crystals 


Physicists have come up with the mind-boggling concept of a time crystal. This intriguing proposal, which is based on the 
notion of broken time-translation symmetry, might open up a whole new field of research. 


PIERS COLEMAN 


long pondered the perpetual periodici- 

ties in time that make up our lives — 
from a heartbeat to the motions of the planets. 
Isaac Newton and Leonardo da Vinci both 
considered perpetual motion, concluding 
that it is the stuff of alchemy. Curiously, we 
now know that perpetual motion is possible, 
but perpetual oscillation, that’s another thing. 
There is a new debate afoot involving the con- 
nection between time and “broken symmetry. 
A snowflake epitomizes this concept, because 
as it crystallizes, it lowers its energy by break- 
ing the homogeneity and isotropy of water. 
Such ‘spontaneously broken symmetry’ per- 
meates the world we live in: for example, the 
orientation of the liquid crystals in a television 
screen and the tiny range of forces between 
subatomic particles both rely on broken sym- 
metry. These cases of broken symmetry leave 
the homogeneity of time unchallenged, but 
the world we live in is also full of clocks — of 
periodicities and oscillations that break time- 
translation symmetry. 

A series of three papers’* published in 
Physical Review Letters proposes and explores 
the evocative new concept of a time crystal. 
The authors of these papers ask: can temporal 
periodicities spontaneously develop in a fash- 
ion analogous to the spatial order of crystals? 
Naturally oscillating physical systems abound: 
for example, the sound of an organ pipe and the 
workings of clocks in all their variety. But these 
systems require an input of energy to drive the 
oscillation. The idea behind a crystal in the 
time domain is that it requires no energy input. 
Like its spatial cousin, it will spontaneously 
form without a driving force. 

It turns out that the concept ofa time crystal 
requires grappling with various subtle issues in 
quantum mechanics. In quantum mechanics, 
states of definite energy are called stationary 
states because the rate of change of any observ- 
able quantity associated with the state is zero. 
Paradoxically, this does not preclude states of 
perpetual motion: a famous example is super- 
conductivity, whereby certain metals, when 
cooled, support perpetual electric currents. It 
is these currents that drive the magnetic fields 
in a magnetic resonance imaging scanner. Ifa 
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Figure 1 | Periodicities in space and time. a, The 
wavefunction, ¥, of a crystal is spatially 
homogeneous. But when observed at one corner, 

it is revealed to have infinite-range spatial 
correlations associated with broken symmetry, 
exhibiting a type of order called off-diagonal long- 
range order’. b, A tiny magnetic field, B, introduced 
into a metal ring induces a persistent ‘supercurrent’ 
in the ring, but time-translation symmetry 

is unbroken. c, Inducing a weak attraction 

between the particles that carry the supercurrent 
causes them to bunch together, leading to the 
development of a ‘solitor’ in the particle density. 
Wilczek proposes’ that, if this soliton is carried 
around with the superflow in the presence of a 
small magnetic field, time-translation symmetry is 
broken and the rotating soliton will form a periodic 
structure in time — a time crystal. 
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tiny magnetic field is passed through a metal 
ring, which is then cooled into the super- 
conducting state, the ring spontaneously 
develops a persistent current. But whereas the 
spontaneously developed motion is perpetual, 
the current is homogeneous and constant, so 
time-translation symmetry is still unbroken. 

In physics, the idea of broken symmetry 
is associated with the concept of an ‘order 
parameter’: in a crystal, this is the periodic 
modulation in its density. Broken symme- 
try manifests itself through the development 
of infinite-range correlations of the order 
parameter. Thus, in a crystal, the density 
modulates coherently over infinite distances. 
In quantum physics, broken symmetry is 
not immediately evident from a system’s 
wavefunction, because the vertices of the 
crystal are equally likely to occupy any point 
in space. For example, you cannot see the 
broken symmetry in a crystal until you meas- 
ure the position of one of its atomic constitu- 
ents. The corresponding crystal wavefunction 
is completely homogeneous in space, because 
it is a superposition of broken-symmetry 
states; once you observe a small piece of the 
crystal, the modulated density becomes visible. 
The development of these entangled, infinite- 
range spatial correlations in an otherwise 
spatially homogeneous wavefunction is called 
off-diagonal long-range order’. 

In the first of the three papers, Wilczek 
introduces’ the concept of a time crystal, argu- 
ing that such a state would exhibit off-diagonal 
long-range order in the time domain. Wilczek 
proposes a thought experiment involving 
the supercurrent spontaneously induced in 
a superconducting ring threaded by a tiny 
magnetic field. Suppose, he argues, one were 
to induce a weak attraction between the par- 
ticles in the superconductor, causing them to 
bunch together. Such a state is called a soliton. 
Provided that the soliton forms without elimi- 
nating the superconductivity, then in a small 
magnetic field a supercurrent will sponta- 
neously emerge, carrying the soliton periodi- 
cally around the ring to forma simple example 
ofa time crystal (Fig. 1). 

Although crystals owe their microscopic 
physics to quantum mechanics, their macro- 
scopic behaviour is governed by classi- 
cal mechanics: time crystals are surely no 


different. In the second paper, Shapere and 
Wilczek show’ that, to understand time 
crystals, one needs an advanced version of 
classical mechanics based on Lagrange’ ‘least 
actio principle, in which the Newtonian 
motion of particles is understood as the path 
that minimizes a combination of kinetic and 
potential energy called the action. They find 
that time crystals are classically feasible, but dis- 
cover that, curiously, once a time crystal devel- 
ops, the concept of energy becomes ambiguous 
and the action principle takes its place. 

But if time crystals exist, how do we detect 
them? In the last of the three papers, Li and 
collaborators’ suggest a method to engineer a 
time crystal using an ion trap. When ions are 
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trapped in a circular ring, their repulsive inter- 
actions cause them to arrange themselves into 
a spatial crystal. Following Wilczek’s proposal’, 
Li et al. suggest that, when cooled ina small 
magnetic field, the crystal will spontaneously 
rotate. This idea is closely related to the idea 
of ‘supersolids, itself a subject of great recent 
interest and controversy”. 

Perhaps the most important aspect of 
the time-crystal notion is that it envisages 
broken time-translation symmetry as an 
equilibrium phenomenon, rather than a 
non-equilibrium response to a driving force. 
This is an exciting yet controversial® devel- 
opment, which if right will open up a whole 
new field of research. With the discovery of 
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superconductors, the debate on perpetual 
motion was brought to an end. With time 
crystals, a new debate has just begun. m= 
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communities to give up part of their fishing 
area for the establishment ofa marine reserve. 

The enthusiasm for marine reserves in the 
conservation community has led to strong 
advocacy for their use as a fishery- management 


Most fish living in marine reserves are older, bigger and more fecund than those 
outside their borders, but they are also slower to flee a threat. The potential for 
‘spillover’ of such fish into fisheries may boost support for reserves. 


PETER F. SALE 


arine reserves are designed to protect 
M the aquatic life within their margins. 

It seems probable that, by providing 
aplace for fish to grow and reproduce, they will 
also benefit adjacent fisheries. Writing in Ecol- 
ogy Letters, Januchowski-Hartley et al.' adda 
new twist to this concept, by demonstrating 
that fish that wander over a reserve bound- 
ary into waters in which fishing occurs are 
more easily caught than those that have lived 
continuously with the threat of fishing. 

Fish living in a reserve in which fishing is 
prohibited are expected to live longer, grow 
larger and have more offspring than those not 
so protected”. Some of these offspring will dis- 
perse naturally beyond the boundaries as eggs 
or larvae, in a process called recruitment sub- 
sidy; the remaining offspring generate greater 
fish numbers and biomass inside the reserve. 
This should, in turn, lead to ‘spillover’ — the 
continuous net movement of juvenile and 
adult individuals beyond reserve boundaries. 
The effects of recruitment subsidy are thought 
to be large and widespread because the lar- 
vae of most fish species are pelagic (they live 
within the water column, rather than near the 
bottom) and are therefore readily dispersed. By 
contrast, the spillover effect is expected to be 
restricted to sites close to the reserve boundary. 

However, this eminently logical theory 
depends on assumptions that are seldom stated 
explicitly’. For this model to hold, the reserve 
must be large enough (or the fish sedentary 
enough) for individuals to experience reduced 
fishing pressure — a fish that wanders widely, 


swimming in and out of a reserve, will get little 
benefit. The reserve must also be well managed, 
so that fishing pressure within its boundaries 
really is lower than that outside. And the fish 
must produce offspring that are dispersive early 
in life. If these requirements are met, marine 
reserves should enhance fishery yields through 
both recruitment subsidy and spillover, and 
this expectation has become a commonly 
used argument in attempts to convince fishing 


tool — an advocacy effort that has jumped well 
ahead of the evidence that real augmentation of 
fishing actually results. Although the extent of 
this overreach has been noted several times””, 
data showing that some reserves seem not to 
enhance fisheries’ have been glossed over, and 
less-positive reports, such as one showing that 
the increase in marine protected areas world- 
wide is having no effect on the global loss of 
biodiversity’, can have somewhat difficult 
paths to publication. Nevertheless, despite the 
hype, there is now abundant evidence that, for 
reasonably sedentary species in many environ- 
ments, marine reserves will contain the more 
abundant and older fish expected by theory™*. 
Concurrently, there is growing evidence of 


Figure 1 | Beyond the boundaries. Januchowski-Hartley et al.' compared the behaviour of fish species 
that are targets for fishing, including surgeonfish (such as Naso vlamingii, shown) and parrotfish, with 
non-targeted butterflyfish. They show that the flight initiation distance — the distance between a fish 
and an approaching snorkeller at which the fish flees — is smaller for targeted species in marine reserves 
than for those in fisheries. Moreover, this distance increases with increasing distance from the reserve 
boundary. No difference in flight behaviour was observed in the non-targeted species. 
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different. In the second paper, Shapere and 
Wilczek show’ that, to understand time 
crystals, one needs an advanced version of 
classical mechanics based on Lagrange’ ‘least 
actio principle, in which the Newtonian 
motion of particles is understood as the path 
that minimizes a combination of kinetic and 
potential energy called the action. They find 
that time crystals are classically feasible, but dis- 
cover that, curiously, once a time crystal devel- 
ops, the concept of energy becomes ambiguous 
and the action principle takes its place. 

But if time crystals exist, how do we detect 
them? In the last of the three papers, Li and 
collaborators’ suggest a method to engineer a 
time crystal using an ion trap. When ions are 
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Li et al. suggest that, when cooled ina small 
magnetic field, the crystal will spontaneously 
rotate. This idea is closely related to the idea 
of ‘supersolids, itself a subject of great recent 
interest and controversy”. 

Perhaps the most important aspect of 
the time-crystal notion is that it envisages 
broken time-translation symmetry as an 
equilibrium phenomenon, rather than a 
non-equilibrium response to a driving force. 
This is an exciting yet controversial® devel- 
opment, which if right will open up a whole 
new field of research. With the discovery of 
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superconductors, the debate on perpetual 
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crystals, a new debate has just begun. m= 
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or larvae, in a process called recruitment sub- 
sidy; the remaining offspring generate greater 
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continuous net movement of juvenile and 
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to be large and widespread because the lar- 
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bottom) and are therefore readily dispersed. By 
contrast, the spillover effect is expected to be 
restricted to sites close to the reserve boundary. 

However, this eminently logical theory 
depends on assumptions that are seldom stated 
explicitly’. For this model to hold, the reserve 
must be large enough (or the fish sedentary 
enough) for individuals to experience reduced 
fishing pressure — a fish that wanders widely, 


swimming in and out of a reserve, will get little 
benefit. The reserve must also be well managed, 
so that fishing pressure within its boundaries 
really is lower than that outside. And the fish 
must produce offspring that are dispersive early 
in life. If these requirements are met, marine 
reserves should enhance fishery yields through 
both recruitment subsidy and spillover, and 
this expectation has become a commonly 
used argument in attempts to convince fishing 


tool — an advocacy effort that has jumped well 
ahead of the evidence that real augmentation of 
fishing actually results. Although the extent of 
this overreach has been noted several times””, 
data showing that some reserves seem not to 
enhance fisheries’ have been glossed over, and 
less-positive reports, such as one showing that 
the increase in marine protected areas world- 
wide is having no effect on the global loss of 
biodiversity’, can have somewhat difficult 
paths to publication. Nevertheless, despite the 
hype, there is now abundant evidence that, for 
reasonably sedentary species in many environ- 
ments, marine reserves will contain the more 
abundant and older fish expected by theory™*. 
Concurrently, there is growing evidence of 


Figure 1 | Beyond the boundaries. Januchowski-Hartley et al.' compared the behaviour of fish species 
that are targets for fishing, including surgeonfish (such as Naso vlamingii, shown) and parrotfish, with 
non-targeted butterflyfish. They show that the flight initiation distance — the distance between a fish 
and an approaching snorkeller at which the fish flees — is smaller for targeted species in marine reserves 
than for those in fisheries. Moreover, this distance increases with increasing distance from the reserve 
boundary. No difference in flight behaviour was observed in the non-targeted species. 
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spillover, particularly in coral-reef systems”®, 
and some evidence for recruitment subsidy, in 
particular from a recent study of larval disper- 
sal from marine reserves within the Great Bar- 
rier Reef Marine Park in Australia’. This study 
found that 55% (for stripey snapper; Lutjanus 
carponotatus) and 83% (for coral trout; Plectro- 
pomus maculatus) of all juveniles produced in 
reserves dispersed to subsidize recruitment to 
surrounding fishing grounds. 

Now, Januchowski-Hartley and colleagues 
take the evidence for spillover a step further. 
They report that, in three Philippine locations, 
fish that live in the safety of a reserve are less 
vigilant than fish living outside the protected 
area, and that they bring their less-vigilant 
behaviour with them when they wander across 
a reserve boundary. 

The researchers looked at the behaviour 
of three families of reef fish: butterflyfish 
(Chaetodontidae), which are not a target 
for fishing, and surgeonfish (Acanthuridae) 
and parrotfish (Scaridae), which are fishery 
targets (Fig. 1). The study design involved 
spotting individual fish engaged in feeding 
or swimming behaviour (not in social activi- 
ties). A snorkeller (Januchowski-Hartley) 
would then descend to about 8-10 metres 
away from the fish, and swim towards it at 
constant slow speed. When the fish fled or 
sought shelter among the coral, the snor- 
keller recorded the linear distance between 
himself and the fish just before it fled — the 
flight initiation distance (FID). The authors 
also recorded the estimated total length of 
the fish. They classified each encounter as 


Topology matters 


The effects of topology have been 
investigated in systems as diverse as 
molecules and the cosmos, but rarely at the 
micrometre scale. In this issue, Senyuk et al. 
explore how topology affects the alignment 
of micro-scale particles suspended in a 
‘nematic’ host matrix — a liquid crystal in 
which the molecules are aligned but do not 
form well-defined planes (B. Senyuk et al. 
Nature 493, 200-205; 2013). 

The particles in naturally occurring colloids 
— systems of tiny particles of one material 
suspended in a different material — are often 
spheres or faceted crystals. Such particles 
are said to have a topology of genus (g) zero. 
To explore the effects of different topologies 
on colloid particles, Senyuk et al. synthesized 
particles constructed from rings 5-10 um in 
diameter. These ranged from simple hoops 
(g=1) to ‘Olympic rings’ (g=5; pictured). 

The authors found that when the particles 
were dispersed in a nematic liquid crystal, 


within one of eight 50-metre-wide zones that 
spanned from 200 m inside to 200 m outside 
the reserve boundary. Finally, they repeated 
the same procedure around control ‘bounda- 
ries within the fishing zone and away from 
marine reserves. 

The main results are straightforward: the 
FID increased for each of the two fished fami- 
lies in all three reserves as the location of the 
encounters moved from inside to outside the 
boundary. No such trend was seen across con- 
trol boundaries or for the non-fished butterfly- 
fish. Because fish are easier to spear if they have 
a shorter FID, these results mean that fish just 
outside the boundary ofa marine reserve will 
be easier to catch (at least by spearfishing) than 
those farther from the boundary. 

The obvious explanation is that fish learn 
to adjust their FID according to the level of 
risk, that FIDs are shorter within reserves 
(where risk is low) and that fish that spill over 
into the fished area take time to increase their 
FIDs. These findings imply that the pres- 
ence of a marine reserve is even more ben- 
eficial to a fishery than previously expected 
— not only do fish spill over, but also those 
that do are naive and more easily caught. 
Marine-reserve advocates will be delighted by 
this suggestion. 

Are these results surprising, and are they 
important? Yes, and yes. I suspect that they 
will surprise those fishery scientists who use 
catch statistics as their primary data, and who 
tend to think of fish as hanging around wait- 
ing to be caught, although they will not sur- 
prise biologists who watch fish before the fish 


they typically aligned with their ring planes 
perpendicular to the direction of alignment of 
the liquid crystal (the ‘director’). Each particle 
of genus g generated at least g—1 defects in 
the liquid-crystal matrix, in agreement with 
topological theorems. These were either point 
defects in the rings’ holes, or loop defects 
that could run around the inside or outside of 
the rings. What’s more, the orientation of the 
liquid-crystal molecules around each particle 
was dictated by the particle’s topology. 
Senyuk and co-workers went on to show 
that the particles could be aligned parallel 
to the director by melting and rapidly 
cooling the surrounding liquid crystals. In 
this case, defects appeared in the liquid 
crystal both within and next to the particles. 
And by applying an electric field at different 
rates, the authors could switch either the 
orientation of the colloidal particles or 
the configuration of the liquid-crystal 
molecules around the particles; the 
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are caught. Fish are behaving beings, and are 
highly capable of adapting their behaviour to 
particular circumstances. Their behaviour as 
larvae, for example, gives them amazing con- 
trol over their dispersal’, and, as juveniles and 
adults, behaviour adds much of the complexity 
that builds up fish social structures and con- 
tributes to the challenges of managing some 
fisheries. Januchowski-Hartley and colleagues’ 
results are a direct demonstration of an unex- 
pected consequence of adaptive fish behaviour, 
and they will be relevant to anyone seeking to 
quantify the effects of a marine reserve on 
fishery yields. m 
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resulting states were stable when the electric 
field was removed. 

Finally, the researchers found that the 
particles diffused more easily along the 
nematic director than in other directions, and 
that the rate of diffusion decreased 
with increasing genus number. Taken 
together, these findings open up fresh 
opportunities for colloidal organization and 
self-assembly, as well as potential applications 
— in electro-optic or photonic devices, for 
example. Rosamund Daw 
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are caught. Fish are behaving beings, and are 
highly capable of adapting their behaviour to 
particular circumstances. Their behaviour as 
larvae, for example, gives them amazing con- 
trol over their dispersal’, and, as juveniles and 
adults, behaviour adds much of the complexity 
that builds up fish social structures and con- 
tributes to the challenges of managing some 
fisheries. Januchowski-Hartley and colleagues’ 
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pected consequence of adaptive fish behaviour, 
and they will be relevant to anyone seeking to 
quantify the effects of a marine reserve on 
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Could one of these conditions be general — or 
‘ancestral’ — for vertebrates, and the other have 
subsequently derived from it? Oisi et al. suggest 
that the cyclostome condition (a median NHP 
is likely to be the earlier structure, because the 
monorhinal condition is found in the extinct 
jawless vertebrates commonly known as 
ostracoderms, which are between 480 million 
and 370 million years old and are regarded as 
more closely related to gnathostomes than to 
cyclostomes (Fig. 1). 

Placodes are embryonic structures that are 
not preserved in ostracoderm fossils, but the 
skeletons of these animals provide informa- 
tion about their differing cranial anatomy”. 
Ostracoderms display diverse arrangements 
of the cranial spaces that housed the olfac- 
tory organs and the adenohypophysis, but 
in all of them these spaces open towards a 
median nostril, as in cyclostomes. For exam- 
ple, anaspids and osteostracans have a median 
dorsal nasohypophyseal complex similar to 
that of lampreys, which suggests some kind of 
cyclostome-like NHP, although other features 
show that osteostracans are closer to gnatho- 
stomes than to any other ostracoderm. By con- 
trast, heterostracans and galeaspids had paired 
olfactory organs, still closely associated with 
the adenohypophysis in the latter, but open- 
ing anteriorly by a large median nostril and 
suggesting a broader NHP (Fig. 1). 

This anatomical diversity is puzzling, but 
Oisi and colleagues’ findings on hagfish 
development help to clarify the picture. The 
authors propose that the presence of the NHP 
in both hagfish and lamprey embryos sup- 
ports the idea ofa general, ‘pan-cyclostome’ 
pattern for craniofacial development, and 
that the anatomical characteristics that make 
present-day hagfish seem more primitive than 
lampreys are the result of a subsequent loss of 
features in hagfish during evolution. Such a 
pattern, combined with the fact that no NHP 
is seen during the development of any living 
gnathostomes, and the apparent progression 
from the monorhinal to diplorhinal condi- 
tion in ostracoderms, suggests that a crucial 
innovation in the development of the verte- 
brate snout was the loss of a common naso- 
hypophyseal duct and the rise of separate, 
paired nostrils. = 
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Phased array on 


a fingertip 


An array of more than 4,000 optical antennas working in unison has been 
demonstrated on a millimetre-scale silicon chip. The result highlights the 
remarkable capabilities of optical integration in silicon. SEE LETTER P.195 


THOMAS F. KRAUSS 


Ferdinand Braun remarked’ that: “It had 

always seemed most desirable to me to 
transmit the waves, in the main, in one direc- 
tion only”. He was referring to radio waves, 
which had hitherto been known to radiate 
only in a doughnut shape away from the emit- 
ter, as described by Heinrich Hertz two dec- 
ades earlier. The invention for which Braun 
was recognized was the phased array — the 
idea of connecting up multiple antennas fed 
from a common source and using the rela- 
tionship between the antennas’ phase (where 
a wave's peaks and troughs lie) to enhance the 
emission of radio waves in a given direction. 
His work was instrumental in the develop- 
ment of radio transmitters, and led to other 
inventions such as the television antenna and 
radar. The phased-array concept is now also 
extensively used in astronomy, where the 
information from multiple telescopes is col- 
lected in phase to extend the resolution of the 
system. On page 195 of this issue, Sun et al.” 
introduce an integrated optical phased array 
that combines the equivalent of more than 
4,000 telescopes onto a single chip smaller 
than a fingernail. 

The creation of phased arrays in the opti- 
cal domain has been an active area of research 
for some time, as it is motivated by the need 
to improve applications such as optical beam 
steering and imaging. To achieve the highest 
performance, the challenges are to integrate a 
large number of antennas onto a single chip, to 
place them close together and to fully control 
the phase on each element. The short opti- 
cal wavelength makes it easy, in principle, to 
address the first challenge and to build large 
arrays on a small footprint in an integrated for- 
mat. The other two challenges are much more 
difficult to meet, however, because the opti- 
cal couplers that feed a controlled amount of 
power into each antenna and the phase shifters 
needed to control the phase on each element 
tend to be tens or hundreds of wavelengths 
long, which requires a large distance between 
each antenna. Therefore, the latest realizations 
of integrated optical phased arrays have been 
relatively large and have featured a maximum 
of only 4x4 antennas”. 


1E his Nobel lecture in 1909, physicist 
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The architecture now introduced by Sun 
et al.” addresses all of these requirements and 
is scalable. By carefully designing the optical 
coupler, phase shifter and gratings that act as 
radiating antennas, the team has managed 
to produce a phased array of 64 x 64 antenna 
units, or ‘pixels’ on a footprint of a little more 
than 0.576 x 0.576 millimetres, with each pixel 
covering 9 x 9 micrometres — equivalent to a 
pixel side length of six optical wavelengths. 
Most impressive are the accuracy and control 
that the researchers were able to exercise over 
the array’s optical functions, especially those 
of the optical couplers. 

The device's accuracy and performance are 
highlighted by the well-balanced power output 
achieved across the array. Equally, the fact that 
a phase shift of 7 radians can be attained on 
a pixel size a few wavelengths across (in con- 
trast to the tens or hundreds of wavelengths 
previously required), and with relatively low 
power and crosstalk, points to the remarkable 
capabilities of nanophotonic concepts that 
are now available. Fabrication of the array, as 
a photonic circuit, using only silicon micro- 
electronics processing techniques — a concept 
pioneered’ by the IMEC microelectronics cen- 
tre in Leuven, Belgium — is also a testimony 
to the high performance that can be achieved 
by using silicon as a photonic material. The 
successful demonstration of the authors’ 
array shows that the large-scale integration 
of photonic circuitry in silicon has well and 
truly arrived. 

So where do we go from here? One inter- 
esting idea is that of imaging through light- 
scattering media’, such as roughened glass or 
biological tissue. Such imaging uses the adap- 
tive-optics principle of adjusting the phase of 
an optical wave locally so as to compensate 
for the distortion caused by the medium. 
The authors’ optical phased array now makes 
available the phase control and large pixel 
number that this application requires. To 
achieve imaging through time-varying light 
scatterers, such as turbulent liquids, a fast 
time-response of the system is also essential; 
the small and efficient tuning of each antenna 
demonstrated by Sun et al. can also meet this 
requirement. Another application is in sensor 
networks, in which communication between 
randomly distributed sensors is based on 


optical beams. Such networks require accurate 
control over the direction of each individual 
beam, which can now be achieved with the 
authors’ array. 

Other applications in imaging, such as 
projection television, involve visible optical 
wavelengths that are shorter than the near- 
infrared wavelengths used by the authors. 
Operation at these wavelengths would require 
further miniaturization of the array’s pixel 
size and waveguides that are not made of sili- 
con, because silicon absorbs light in the vis- 
ible range. Also, the multiple images seen in 
the device’s radiation pattern (see Fig. 2 of 
the paper’) are due to the fact that the pixels 
are multiple wavelengths apart. To reduce the 
number of images, ideally to a single one, it will 


STRUCTURAL BIOLOGY 


be necessary to make the pixels even smaller 
and to move them closer together. This might 
be achievable by stacking the optical coupler, 
phase shifter and grating antenna on top of 
one another, thereby integrating the array in 
three dimensions. Clearly, further miniaturi- 
zation and three-dimensional integration are 
exciting challenges that can now be considered 
seriously. 

Most remarkably, even a century ago Braun 
had already thought ahead, and considered 
whether the antenna effects he described 
might also be observable in the optical domain. 
He wondered! whether the concepts he had 
studied with radio waves could “also be linked 
up with optical phenomena, though this can 
hardly be experimentally verified in this field”. 


Insulin meets 


its receptor 


The hormone insulin has a central role in human physiology, yet the answer toa 
fundamental biochemical question — how it binds to its cell-surface receptor — 
has remained elusive, until now. SEE LETTER P.241 


STEVAN R. HUBBARD 


of physiology and biochemistry’. First 

isolated in 1921, it was soon used to save 
the lives of individuals with diabetes. This 
medical breakthrough and subsequent bio- 
chemical research on insulin have garnered 
three Nobel prizes. The three-dimensional 
structure of insulin was determined by X-ray 
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Figure 1 | Binding of insulin to its receptor. a, Diagram of the dimeric 
insulin receptor. The extracellular region comprises domains L1, C, L2, F1, 
F2 and F3. One half-receptor (at the back) is coloured green, with its domain 
labels marked by an asterisk, and the other half-receptor is in blue; L1 and L1* 
are coloured in darker shades. The carboxy-terminal region of the a-chain 
(aCT) extends from the first half of F2 and folds back onto L1 from the 
opposite half-receptor. b, c, Crystal structure of the extracellular region 

of the apo insulin receptor’®, shown in surface representation (b) and rotated 


crystallography in 1969. Yet, despite intense 
interest, an atomic view of insulin bound to its 
receptor has been lacking — a situation rec- 
tified by the report of Menting et al.’ in this 
issue. 

Over the years, researchers have used pro- 
tein crystallography to determine the struc- 
ture of numerous hormones, growth factors 
and cytokines bound to their receptors**. In 
general, structures of these ligand-receptor 


apo receptor (as inc). 
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Now, Sun et al. have provided us with just such 
an experimental demonstration. Braun would 
surely have been pleased indeed. m 
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systems were tractable because the ligand-bind- 
ing domains of the receptors were of moderate 
size and many could be produced in bacteria. 
Moreover, the binding modes of these ligands 
to their receptors are relatively simple, involv- 
ing one or two receptor subdomains that are 
contiguous in amino-acid sequence. However, 
the production of insulin-receptor protein for 
structural studies is fraught with challenges. 
The insulin receptor belongs to the receptor 
tyrosine-kinase family of cell-surface recep- 
tors’. These are single-pass transmembrane 
proteins with an extracellular region that binds 
a protein ligand (typically a growth factor) and 
a cytoplasmic region that contains a tyrosine- 
kinase domain, which phosphorylates specific 
tyrosine amino-acid residues on other signal- 
ling proteins, as well as on the receptor itself. 
On ligand binding, many receptor tyrosine 
kinases form a dimer, which facilitates tyro- 
sine phosphorylation of one cytoplasmic 
domain by the other, thereby activating the 
receptor®. By contrast, the insulin receptor 
is a preformed, disulphide-bridged dimer, 


Insulin 


by 90° (c). The colouring is consistent with that in a. The red circle 

points to one of the two equivalent insulin-binding sites. d, Menting et al.’ 
present the structure of insulin bound to L1 and aCT. The viewing angle of L1 
is approximately the same as in c. Side chains of interacting residues in insulin, 
L1 and aCT are shown in stick representation. Residues in the A and B chains 
of insulin are coloured pink and purple, respectively. e, As in d, but with 

the other half-receptor superimposed from the structure of the 
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optical beams. Such networks require accurate 
control over the direction of each individual 
beam, which can now be achieved with the 
authors’ array. 

Other applications in imaging, such as 
projection television, involve visible optical 
wavelengths that are shorter than the near- 
infrared wavelengths used by the authors. 
Operation at these wavelengths would require 
further miniaturization of the array’s pixel 
size and waveguides that are not made of sili- 
con, because silicon absorbs light in the vis- 
ible range. Also, the multiple images seen in 
the device’s radiation pattern (see Fig. 2 of 
the paper’) are due to the fact that the pixels 
are multiple wavelengths apart. To reduce the 
number of images, ideally to a single one, it will 
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be necessary to make the pixels even smaller 
and to move them closer together. This might 
be achievable by stacking the optical coupler, 
phase shifter and grating antenna on top of 
one another, thereby integrating the array in 
three dimensions. Clearly, further miniaturi- 
zation and three-dimensional integration are 
exciting challenges that can now be considered 
seriously. 

Most remarkably, even a century ago Braun 
had already thought ahead, and considered 
whether the antenna effects he described 
might also be observable in the optical domain. 
He wondered! whether the concepts he had 
studied with radio waves could “also be linked 
up with optical phenomena, though this can 
hardly be experimentally verified in this field”. 


Insulin meets 


its receptor 


The hormone insulin has a central role in human physiology, yet the answer toa 
fundamental biochemical question — how it binds to its cell-surface receptor — 
has remained elusive, until now. SEE LETTER P.241 


STEVAN R. HUBBARD 


of physiology and biochemistry’. First 

isolated in 1921, it was soon used to save 
the lives of individuals with diabetes. This 
medical breakthrough and subsequent bio- 
chemical research on insulin have garnered 
three Nobel prizes. The three-dimensional 
structure of insulin was determined by X-ray 
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Figure 1 | Binding of insulin to its receptor. a, Diagram of the dimeric 
insulin receptor. The extracellular region comprises domains L1, C, L2, F1, 
F2 and F3. One half-receptor (at the back) is coloured green, with its domain 
labels marked by an asterisk, and the other half-receptor is in blue; L1 and L1* 
are coloured in darker shades. The carboxy-terminal region of the a-chain 
(aCT) extends from the first half of F2 and folds back onto L1 from the 
opposite half-receptor. b, c, Crystal structure of the extracellular region 

of the apo insulin receptor’®, shown in surface representation (b) and rotated 


crystallography in 1969. Yet, despite intense 
interest, an atomic view of insulin bound to its 
receptor has been lacking — a situation rec- 
tified by the report of Menting et al.’ in this 
issue. 

Over the years, researchers have used pro- 
tein crystallography to determine the struc- 
ture of numerous hormones, growth factors 
and cytokines bound to their receptors**. In 
general, structures of these ligand-receptor 


apo receptor (as inc). 
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Now, Sun et al. have provided us with just such 
an experimental demonstration. Braun would 
surely have been pleased indeed. m 
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systems were tractable because the ligand-bind- 
ing domains of the receptors were of moderate 
size and many could be produced in bacteria. 
Moreover, the binding modes of these ligands 
to their receptors are relatively simple, involv- 
ing one or two receptor subdomains that are 
contiguous in amino-acid sequence. However, 
the production of insulin-receptor protein for 
structural studies is fraught with challenges. 
The insulin receptor belongs to the receptor 
tyrosine-kinase family of cell-surface recep- 
tors’. These are single-pass transmembrane 
proteins with an extracellular region that binds 
a protein ligand (typically a growth factor) and 
a cytoplasmic region that contains a tyrosine- 
kinase domain, which phosphorylates specific 
tyrosine amino-acid residues on other signal- 
ling proteins, as well as on the receptor itself. 
On ligand binding, many receptor tyrosine 
kinases form a dimer, which facilitates tyro- 
sine phosphorylation of one cytoplasmic 
domain by the other, thereby activating the 
receptor®. By contrast, the insulin receptor 
is a preformed, disulphide-bridged dimer, 
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by 90° (c). The colouring is consistent with that in a. The red circle 

points to one of the two equivalent insulin-binding sites. d, Menting et al.’ 
present the structure of insulin bound to L1 and aCT. The viewing angle of L1 
is approximately the same as in c. Side chains of interacting residues in insulin, 
L1 and aCT are shown in stick representation. Residues in the A and B chains 
of insulin are coloured pink and purple, respectively. e, As in d, but with 

the other half-receptor superimposed from the structure of the 
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and insulin binding is thought to induce a 
conformational change in the receptor that 
initiates cross-phosphorylation of the two 
cytoplasmic domains. 

The mature insulin receptor consists of two 
copies of two polypeptide chains, a and B. The 
a-chain (723 amino-acid residues) is entirely 
extracellular and heavily glycosylated. The 
B-chain (620 residues) starts on the extra- 
cellular side and spans the membrane (by way 
of an a-helix) to the cytoplasmic side, which 
harbours the tyrosine-kinase domain (Fig. la). 
Each a-chain is linked to a B-chain through a 
disulphide bridge to form an af half-receptor, 
and the two half-receptors are linked by at least 
two (and possibly four) disulphide bridges. 
The extracellular region of each half-receptor 
contains a series of folded domains: L1, C, L2, 
F1, F2 and F3 (Fig. 1a). 

Mature insulin consists of two polypeptide 
chains: an A chain of 21 residues anda B chain 
of 30 residues. It contains one intrachain 
disulphide bridge (in the A chain) and two 
interchain disulphide bridges. 

Previous biochemical studies’” established 
that a single insulin molecule binds, with 
sub-nanomolar affinity, to the dimeric recep- 
tor (1:2 stoichiometry) and activates it. The 
two equivalent insulin-binding sites on 
the receptor are composed of two distinct 
interfaces. The primary interface consists 
of the L1 domain of one half-receptor and 
the carboxy-terminal region of the a-chain 
(aCT) of the second half-receptor. The sec- 
ondary interface comprises the loop regions 
near the junction of F1 and F2 in the second 
half-receptor. 

In 2006, some of the authors of the current 
study determined the atomic structure of the 
dimeric extracellular portion of the insulin 
receptor in the apo conformation (devoid of 
insulin). This landmark structure® revealed an 
inverted V shape, with the two half-receptors 
arranged in an antiparallel fashion (Fig. 1b,c). 
Although an insulin-mimetic peptide was pre- 
sentin the crystallization trials, it could not be 
located in the electron-density map and was 
presumed to be unbound. 

Menting et al. now use some of the same 
strategies to crystallize truncated versions of 
the a-chain, containing only the primary bind- 
ing interface (involving L1 and aCT), in the 
presence of insulin. Because the aCT region is 
a considerable distance in sequence from L1, 
they either added it as a synthetic peptide or 
spliced it onto the C-terminal end of the trun- 
cated a-chain construct. This time, the catch 
was made: insulin was visible in the electron- 
density maps along with the receptor. 

In the four crystal structures the authors 
determined, insulin is bound to the L1 
domain, but just barely (Fig. 1d). Almost all 
of the hydrophobic residues extending from 
the flat 8-sheet surface of L1, which were pre- 
viously mapped’ as insulin-binding ‘hotspots, 
contact the aCT helix rather than insulin, and 


it is the aCT helix that is in intimate contact 
with insulin. 

Two structural transitions occur on insulin 
binding, one in insulin and one in aCT (L1 is 
not appreciably altered). In structures of insu- 
lin alone, residues at the C-terminal end of the 
B chain pack against the rest of the molecule, 
but are displaced on binding to aCT. This 
observation confirms a long-standing predic- 
tion that such a structural transition occurs in 
insulin on binding to its receptor’. What was 
not predicted, however, was the behaviour 
of aCT. In the apo structure, the aCT helix 
is bound to L1, but insulin binding causes a 
repositioning of the helix on the L1 surface such 
that it interacts with insulin, as well as with L1. 

Menting and colleagues’ structures also 
provide insights into the receptor-activation 
mechanism. If the L1 domain from the insulin- 
bound structure is superimposed on L1 from 
the apo structure, it is apparent that a confor- 
mational change must take place in the other 
half-receptor (near F1 and F2, the secondary 
binding interface) to accommodate insulin 
binding to aCT-L1 (Fig. le). Evidently, this 
change underlies the structural mechanism 
whereby insulin binding triggers cross-phos- 
phorylation of the cytoplasmic kinase domains 
of its receptor. The precise nature of this con- 
formational change and howit repositions the 
kinase domains remain unresolved. 


Fast-acting insulin analogues have been 
used to treat patients with diabetes for many 
years, and the three-dimensional structure of 
insulin was instrumental in their design’. The 
structure of insulin bound to its receptor is not 
only the latest milestone in insulin research, 
but should also afford opportunities for the 
design of insulin analogues with enhanced 
receptor binding, as well as favourable 
pharmacokinetics. = 
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Wasted loops quantified 


A method for dissecting the polymeric networks of gels enables the number of 
loops — strands that connect to themselves — within them to be counted. This 
allows network morphologies to be correlated with gel properties. 


ANNA C. BALAZS 


roducts as diverse as jelly, pharma- 
Pp ceuticals and cosmetics often contain 

a common ingredient: a gel formed 
from an extensive network of cross-linked 
polymer chains. Such gels typically encase a 
large amount of fluid. Hydrogels, for example, 
contain as much as 90% water, yet can still be 
as strong and tough as rubber’. The strength 
of these materials comes from the polymers 
that interconnect multiple chains within the 
network, whereas strands that form non-con- 
necting loops weaken the material (Fig. 1). 
Writing in Proceedings of the National Academy 
of Sciences, Zhou et al.’ report an experimental 
method for determining the number of loops 
— something that wasn’t previously possible. 
The ability to quantify the fraction ofa gel that 
consists of such ‘wasted’ loops’ should enable 
researchers to establish synthetic routes that 
minimize the formation of these topologies, 
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and thereby optimize the strength and 
toughness of such materials. 

Zhou et al. cleverly realized that to deter- 
mine the fraction of loops in a polymer gel, 
they had to dissect it, and so they designed a 
material that could be carefully and precisely 
broken apart. Their hydrogel was composed 
of a linear polymer that contained a reactive 
group for forming cross-links at each end, and 
a branched monomer that contained three 
reactive end groups. The linear polymer was 
designed to contain a strategically placed, 
readily cleavable chemical bond. 

The researchers made the gel by combin- 
ing the components in solution, and then 
applied their careful dissection technique to 
break the readily cleavable bonds. Because 
these bonds were precisely placed, only four 
products would form from this degradation 
reaction ifan ideal network without any loops 
had been made; each of those products had its 
own distinctive molecular mass. But if some 
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and insulin binding is thought to induce a 
conformational change in the receptor that 
initiates cross-phosphorylation of the two 
cytoplasmic domains. 

The mature insulin receptor consists of two 
copies of two polypeptide chains, a and B. The 
a-chain (723 amino-acid residues) is entirely 
extracellular and heavily glycosylated. The 
B-chain (620 residues) starts on the extra- 
cellular side and spans the membrane (by way 
of an a-helix) to the cytoplasmic side, which 
harbours the tyrosine-kinase domain (Fig. la). 
Each a-chain is linked to a B-chain through a 
disulphide bridge to form an af half-receptor, 
and the two half-receptors are linked by at least 
two (and possibly four) disulphide bridges. 
The extracellular region of each half-receptor 
contains a series of folded domains: L1, C, L2, 
F1, F2 and F3 (Fig. 1a). 

Mature insulin consists of two polypeptide 
chains: an A chain of 21 residues anda B chain 
of 30 residues. It contains one intrachain 
disulphide bridge (in the A chain) and two 
interchain disulphide bridges. 

Previous biochemical studies’” established 
that a single insulin molecule binds, with 
sub-nanomolar affinity, to the dimeric recep- 
tor (1:2 stoichiometry) and activates it. The 
two equivalent insulin-binding sites on 
the receptor are composed of two distinct 
interfaces. The primary interface consists 
of the L1 domain of one half-receptor and 
the carboxy-terminal region of the a-chain 
(aCT) of the second half-receptor. The sec- 
ondary interface comprises the loop regions 
near the junction of F1 and F2 in the second 
half-receptor. 

In 2006, some of the authors of the current 
study determined the atomic structure of the 
dimeric extracellular portion of the insulin 
receptor in the apo conformation (devoid of 
insulin). This landmark structure® revealed an 
inverted V shape, with the two half-receptors 
arranged in an antiparallel fashion (Fig. 1b,c). 
Although an insulin-mimetic peptide was pre- 
sentin the crystallization trials, it could not be 
located in the electron-density map and was 
presumed to be unbound. 

Menting et al. now use some of the same 
strategies to crystallize truncated versions of 
the a-chain, containing only the primary bind- 
ing interface (involving L1 and aCT), in the 
presence of insulin. Because the aCT region is 
a considerable distance in sequence from L1, 
they either added it as a synthetic peptide or 
spliced it onto the C-terminal end of the trun- 
cated a-chain construct. This time, the catch 
was made: insulin was visible in the electron- 
density maps along with the receptor. 

In the four crystal structures the authors 
determined, insulin is bound to the L1 
domain, but just barely (Fig. 1d). Almost all 
of the hydrophobic residues extending from 
the flat 8-sheet surface of L1, which were pre- 
viously mapped’ as insulin-binding ‘hotspots, 
contact the aCT helix rather than insulin, and 


it is the aCT helix that is in intimate contact 
with insulin. 

Two structural transitions occur on insulin 
binding, one in insulin and one in aCT (L1 is 
not appreciably altered). In structures of insu- 
lin alone, residues at the C-terminal end of the 
B chain pack against the rest of the molecule, 
but are displaced on binding to aCT. This 
observation confirms a long-standing predic- 
tion that such a structural transition occurs in 
insulin on binding to its receptor’. What was 
not predicted, however, was the behaviour 
of aCT. In the apo structure, the aCT helix 
is bound to L1, but insulin binding causes a 
repositioning of the helix on the L1 surface such 
that it interacts with insulin, as well as with L1. 

Menting and colleagues’ structures also 
provide insights into the receptor-activation 
mechanism. If the L1 domain from the insulin- 
bound structure is superimposed on L1 from 
the apo structure, it is apparent that a confor- 
mational change must take place in the other 
half-receptor (near F1 and F2, the secondary 
binding interface) to accommodate insulin 
binding to aCT-L1 (Fig. le). Evidently, this 
change underlies the structural mechanism 
whereby insulin binding triggers cross-phos- 
phorylation of the cytoplasmic kinase domains 
of its receptor. The precise nature of this con- 
formational change and howit repositions the 
kinase domains remain unresolved. 


Fast-acting insulin analogues have been 
used to treat patients with diabetes for many 
years, and the three-dimensional structure of 
insulin was instrumental in their design’. The 
structure of insulin bound to its receptor is not 
only the latest milestone in insulin research, 
but should also afford opportunities for the 
design of insulin analogues with enhanced 
receptor binding, as well as favourable 
pharmacokinetics. = 
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Wasted loops quantified 


A method for dissecting the polymeric networks of gels enables the number of 
loops — strands that connect to themselves — within them to be counted. This 
allows network morphologies to be correlated with gel properties. 


ANNA C. BALAZS 


roducts as diverse as jelly, pharma- 
Pp ceuticals and cosmetics often contain 

a common ingredient: a gel formed 
from an extensive network of cross-linked 
polymer chains. Such gels typically encase a 
large amount of fluid. Hydrogels, for example, 
contain as much as 90% water, yet can still be 
as strong and tough as rubber’. The strength 
of these materials comes from the polymers 
that interconnect multiple chains within the 
network, whereas strands that form non-con- 
necting loops weaken the material (Fig. 1). 
Writing in Proceedings of the National Academy 
of Sciences, Zhou et al.’ report an experimental 
method for determining the number of loops 
— something that wasn’t previously possible. 
The ability to quantify the fraction ofa gel that 
consists of such ‘wasted’ loops’ should enable 
researchers to establish synthetic routes that 
minimize the formation of these topologies, 
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and thereby optimize the strength and 
toughness of such materials. 

Zhou et al. cleverly realized that to deter- 
mine the fraction of loops in a polymer gel, 
they had to dissect it, and so they designed a 
material that could be carefully and precisely 
broken apart. Their hydrogel was composed 
of a linear polymer that contained a reactive 
group for forming cross-links at each end, and 
a branched monomer that contained three 
reactive end groups. The linear polymer was 
designed to contain a strategically placed, 
readily cleavable chemical bond. 

The researchers made the gel by combin- 
ing the components in solution, and then 
applied their careful dissection technique to 
break the readily cleavable bonds. Because 
these bonds were precisely placed, only four 
products would form from this degradation 
reaction ifan ideal network without any loops 
had been made; each of those products had its 
own distinctive molecular mass. But if some 


Figure 1 | Primary loops in cross-linked polymer 
networks. a, In an ideal polymer network, every 
polymer strand forms connections to other 
strands, strengthening the resulting material. 

b, Primary loops form if a strand closes on itself 
without connecting to other chains. The presence 
of such loops reduces the strength of the material. 
Zhou et al.” report a method for quantifying the 
number of loops in the polymer network of a gel. 


loops had formed, different degradation prod- 
ucts (with their own characteristic molecular 
masses) would be obtained. By measuring the 
masses of the actual degradation products and 
comparing them to the predicted masses for 
the ideal network, the authors could there- 
fore determine the number of loops in the 
system. They named this technique ‘network 
disassembly spectrometry’ (NDS). 

A potential limitation of NDS is that one 
must be able to predict the degradation 
products in order to apply the technique. 
Nevertheless, a broad range of polymers and 
chemistries will be amenable to this ana- 
lytical approach. What’s more, the technique 
enables new fundamental studies to be made 
of the dynamics of polymer-network forma- 
tion. For example, by using NDS at different 
stages of their network’s formation, Zhou 
et al. were able to probe the structural evolu- 
tion of the gel. And by varying the reaction 
conditions of network assembly, they could 
control the fraction of loops that formed in 
the system, and thus gain valuable insight into 
the factors that contribute to the formation of 
these ‘defects. 

One application of NDS might be as a tool 
for correlating the microstructure of gels with 
their mechanical properties. This could be use- 
ful in the development of artificial muscles that 
perform useful work’. For example, some gels 
controllably expand and contract in response 
to external stimuli (such as changes in pH, 
illumination or heat)’ or internal chemical 
reactions’, and so can function as actuators 


that drive movement within a system. 

The rhythmic swelling and deswelling of 
gels has been harnessed to push micrometre- 
sized particles and cells along a surface’. 
Heat-responsive gels have also been used to 
drive micrometre-sized posts in and out ofa 
bath of solution’: when the system was hot, 
the gel shrank, pulling the posts out of the 
bath, but when the system cooled down, the 
gel expanded and pushed the posts back in. 
This behaviour formed the basis of a homeo- 
static device that autonomously regulated 
the temperature of the system. For optimal 
performance, such soft actuators need to be 
mechanically robust — which means that they 
should not contain wasted loops. 

On the other hand, because NDS can yield 
correlations between mechanical properties 
and the fraction of loops ina gel, it might help 
to provide design rules for exploiting loops, 
perhaps leading to the development of robust 
materials that nonetheless are highly flexible. 
In particular, there is rapidly growing inter- 
est in reconfigurable materials that can dra- 
matically change shape in response to external 
cues® '°, Examples abound in biology, because 
such adaptive behaviour is vital for survival — 
consider, for example, the ability of octopuses 
and cuttlefish to change their shape, colour and 
texture in order to camouflage themselves in 
the presence of predators. Because gels can be 
made to shrink and swell controllably, they can 
be driven to change shape, and thus are ideal 
synthetic materials for creating systems that 
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accomplish analogous adaptive behaviours. 

Advances in this area could lead to multi- 
functional systems that have one structure 
and function in one environment, but another 
shape and function under different condi- 
tions. However, to undergo significant shape 
changes, the material must be very flexible. It 
might be that the introduction of just the right 
amount of loops within a polymer network 
could lead to robust, but sufficiently flexible, 
shape-changing materials. With findings 
from NDS, perhaps researchers will eventu- 
ally establish effective routes for making gels 
that act like cuttlefish. m 
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How glaciers grow 


A state-of-the-art numerical model shows that the advance of glaciers ina 
cooling climate depends strongly on the pre-existing landscape, and that glacial 
erosion paves the way for greater glacial extent in the future. SEE LETTER P.206 


SIMON H. BROCKLEHURST 


he growth of glaciers reflects the balance 

between the accumulation of snow 

and its loss through melting. Given the 
close association between altitude and tem- 
perature’, the elevation and morphology of the 
valley floor on which a glacier forms are key 
determinants of a glacier’s size and longevity. 
Glacial erosion has carved the spectacular 
alpine landscapes of many mountain ranges, 
and these are characterized by extensive glacial 
valley floors at elevations close to the long-term 
snowline’. These landscapes contrast strongly 
with their precursors, which are generally 
steeper, narrower valleys sculpted by rivers 
(Fig. 1). On page 206 of this issue, Pedersen 
and Egholm’ use a state-of-the-art numeri- 
cal model of glacier dynamics to quantify, for 


the first time, the stark contrast in glacier 
development between modern alpine moun- 
tain ranges and those that came before them. 
They compare the legacy of numerous glacia- 
tions during the Pleistocene epoch (between 
about 2.5 million and 10,000 years ago) with 
the landscapes that would have been present 
at the onset of these glaciations. 

It has long been recognized that glaciers 
and their surroundings share an intimate, 
coupled relationship*”. Shaded aspects and 
large areas at high elevation promote glacier 
growth. In turn, glacial erosion strongly modi- 
fies the underlying landscape, widening valley 
floors and making hill slopes steeper. However, 
given the efficiency of glaciers at reworking 
and removing the evidence left by previous 
glacial cycles, direct insight into the relation- 
ship between glaciers and topography during 
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Figure 1 | Primary loops in cross-linked polymer 
networks. a, In an ideal polymer network, every 
polymer strand forms connections to other 
strands, strengthening the resulting material. 

b, Primary loops form if a strand closes on itself 
without connecting to other chains. The presence 
of such loops reduces the strength of the material. 
Zhou et al.” report a method for quantifying the 
number of loops in the polymer network of a gel. 


loops had formed, different degradation prod- 
ucts (with their own characteristic molecular 
masses) would be obtained. By measuring the 
masses of the actual degradation products and 
comparing them to the predicted masses for 
the ideal network, the authors could there- 
fore determine the number of loops in the 
system. They named this technique ‘network 
disassembly spectrometry’ (NDS). 

A potential limitation of NDS is that one 
must be able to predict the degradation 
products in order to apply the technique. 
Nevertheless, a broad range of polymers and 
chemistries will be amenable to this ana- 
lytical approach. What’s more, the technique 
enables new fundamental studies to be made 
of the dynamics of polymer-network forma- 
tion. For example, by using NDS at different 
stages of their network’s formation, Zhou 
et al. were able to probe the structural evolu- 
tion of the gel. And by varying the reaction 
conditions of network assembly, they could 
control the fraction of loops that formed in 
the system, and thus gain valuable insight into 
the factors that contribute to the formation of 
these ‘defects. 

One application of NDS might be as a tool 
for correlating the microstructure of gels with 
their mechanical properties. This could be use- 
ful in the development of artificial muscles that 
perform useful work’. For example, some gels 
controllably expand and contract in response 
to external stimuli (such as changes in pH, 
illumination or heat)’ or internal chemical 
reactions’, and so can function as actuators 


that drive movement within a system. 

The rhythmic swelling and deswelling of 
gels has been harnessed to push micrometre- 
sized particles and cells along a surface’. 
Heat-responsive gels have also been used to 
drive micrometre-sized posts in and out ofa 
bath of solution’: when the system was hot, 
the gel shrank, pulling the posts out of the 
bath, but when the system cooled down, the 
gel expanded and pushed the posts back in. 
This behaviour formed the basis of a homeo- 
static device that autonomously regulated 
the temperature of the system. For optimal 
performance, such soft actuators need to be 
mechanically robust — which means that they 
should not contain wasted loops. 

On the other hand, because NDS can yield 
correlations between mechanical properties 
and the fraction of loops ina gel, it might help 
to provide design rules for exploiting loops, 
perhaps leading to the development of robust 
materials that nonetheless are highly flexible. 
In particular, there is rapidly growing inter- 
est in reconfigurable materials that can dra- 
matically change shape in response to external 
cues® '°, Examples abound in biology, because 
such adaptive behaviour is vital for survival — 
consider, for example, the ability of octopuses 
and cuttlefish to change their shape, colour and 
texture in order to camouflage themselves in 
the presence of predators. Because gels can be 
made to shrink and swell controllably, they can 
be driven to change shape, and thus are ideal 
synthetic materials for creating systems that 
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accomplish analogous adaptive behaviours. 

Advances in this area could lead to multi- 
functional systems that have one structure 
and function in one environment, but another 
shape and function under different condi- 
tions. However, to undergo significant shape 
changes, the material must be very flexible. It 
might be that the introduction of just the right 
amount of loops within a polymer network 
could lead to robust, but sufficiently flexible, 
shape-changing materials. With findings 
from NDS, perhaps researchers will eventu- 
ally establish effective routes for making gels 
that act like cuttlefish. m 
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How glaciers grow 


A state-of-the-art numerical model shows that the advance of glaciers ina 
cooling climate depends strongly on the pre-existing landscape, and that glacial 
erosion paves the way for greater glacial extent in the future. SEE LETTER P.206 


SIMON H. BROCKLEHURST 


he growth of glaciers reflects the balance 

between the accumulation of snow 

and its loss through melting. Given the 
close association between altitude and tem- 
perature’, the elevation and morphology of the 
valley floor on which a glacier forms are key 
determinants of a glacier’s size and longevity. 
Glacial erosion has carved the spectacular 
alpine landscapes of many mountain ranges, 
and these are characterized by extensive glacial 
valley floors at elevations close to the long-term 
snowline’. These landscapes contrast strongly 
with their precursors, which are generally 
steeper, narrower valleys sculpted by rivers 
(Fig. 1). On page 206 of this issue, Pedersen 
and Egholm’ use a state-of-the-art numeri- 
cal model of glacier dynamics to quantify, for 


the first time, the stark contrast in glacier 
development between modern alpine moun- 
tain ranges and those that came before them. 
They compare the legacy of numerous glacia- 
tions during the Pleistocene epoch (between 
about 2.5 million and 10,000 years ago) with 
the landscapes that would have been present 
at the onset of these glaciations. 

It has long been recognized that glaciers 
and their surroundings share an intimate, 
coupled relationship*”. Shaded aspects and 
large areas at high elevation promote glacier 
growth. In turn, glacial erosion strongly modi- 
fies the underlying landscape, widening valley 
floors and making hill slopes steeper. However, 
given the efficiency of glaciers at reworking 
and removing the evidence left by previous 
glacial cycles, direct insight into the relation- 
ship between glaciers and topography during 
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the initial phases of late Cenozoic glaciation 
(between about 2.5 million and 1 million years 
ago) has remained frustratingly elusive. 

Numerical modelling offers the opportu- 
nity to explore glacier growth during initial 
glaciation. Pedersen and Egholm demon- 
strate an almost linear relationship between 
the degree of climate cooling and ice volume 
when glaciers develop in fluvial landscapes. 
However, for landscapes previously occu- 
pied and sculpted by glaciers, the equiva- 
lent relationship is highly nonlinear. Once 
the climate cools sufficiently, glaciers that 
descend onto the wide, shallow valley floor 
carved by preceding glacial occupations 
expand markedly. 

Pedersen and Egholm also explore the full 
evolution of modern glacial landscapes by 
driving a simplified version of their numerical 
model — that is, without fluvial or hill-slope 
erosion, or active tectonics — with tempera- 
ture fluctuations representing the past 2 mil- 
lion years. The mid-Pleistocene transition 
(MPT) about 950,000 years ago marked a 
shift from 40,000-year glacial cycles accom- 
panied by symmetrical periods of cooling 
and warming to the protracted cooling and 
rapid warming of the more recent 100,000- 
year glacial cycles. The authors find much 
more extensive and erosive glaciers after 
the MPT than during the previous 1 million 
years. This result is consistent with geological 
evidence for accelerated glacial erosion after 
the MPT®’. However, as Pedersen and Egholm 
show, this accelerated erosion is not simply a 
consequence of the changing climate; faster 
glacial erosion post-MPT was preconditioned 
by the landscape modifications made by the 
preceding, smaller-scale glaciations. 

Pedersen and Egholm’ have taken on a 
considerable challenge. Numerical model- 
ling of landscape evolution resulting from 


Figure 1 | The morphological contrast between fluvial and glacial 
landscapes. As documented by Pedersen and Egholm’, the width and 
downstream gradient of a valley floor determine how glaciers would 
develop should the climate cool sufficiently. a, A fluvial landscape 
characterized by a narrow, steep valley: Taroko Valley, Hualien, Taiwan. 
Many of the world’s mountain ranges would have been dominated by fluvial 


% 


glacial erosion is a complex, multi-faceted 
problem. As such, the first models of glacial 
landscape evolution have emerged only within 
the past 15 years*. Egholm and co-authors” 
have led the field and achieved compellingly 
realistic results by introducing ice-dynamics 
formulations that are more appropriate to 
valley glaciers — which descend the rela- 
tively steep downstream gradient of the valley 
floor and interact with the valley sides —than 
earlier simplifications borrowed from ice-sheet 
models. 

Many considerations are involved in any 
attempt to numerically model glaciers and 
glacial erosion. The amount of snow that falls 
is strongly influenced by local climate, and is 
sensitive to changes in atmospheric moisture 
content, temperature and prevailing wind 
direction. Falling snow can be redistributed by 
wind” and avalanching before it reaches the 
glacier surface. Glacial-ice melt is influenced 
by temperature change, shading and rock 
fall from the valley sides. Glacial ice deforms 
under its own weight, and can also slide on its 
bed if liquid water is present. However, at this 
point, this generic model still only describes 
ice accumulating, deforming and melting. 
How would this moving ice (and water) erode 
the landscape? 

As was recently noted", models of glacial 
landscape evolution, including Pedersen and 
Egholms, still rely on simple, empirical rela- 
tionships between glacier sliding velocity and 
glacial erosion rate, supported by modest 
field-data sets. There is little clear connection 
between these numerical relationships and 
the quarrying of large bedrock blocks from 
the glacier bed, which in most circumstances 
represents the primary process of glacial 
erosion. The liquid water in the subglacial 
hydrological network is also known to have 
a key role in glacial erosion, but determining 
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erosion before the onset of glaciations about 2.5 million years ago. The first 
glaciers to develop in such fluvial landscapes will show an almost linear 
relationship between temperature change and subsequent ice volume. 

b, A glacial landscape: Yosemite Valley, California. Ice volume will increase 
dramatically if the temperature cools sufficiently for the glacier to occupy 
the broad, shallow valley floor. 


the appropriate numerical formulation for 
the role of subglacial water is in its infancy”. 
Further progress will require numerical models 
that are robustly supported by a combination 
of careful observations of modern glaciers” 
and thermochronological evidence for how 
glacial landscapes have evolved over the 
longer timescales’ that are explored numeri- 
cally here. It will also require driving models 
with more-realistic climate representations’”"* 
than those currently considered, and will 
probably depend on continuing advances in 
computer power. m 
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Post-replicative repair involves separase- dependent 
removal of the kleisin subunit of cohesin 


Alexandra McAleenan', Andres Clemente-Blanco!, Violeta Cordon-Preciado!, Nicholas Sen', Miguel Esteras!, Adam Jarmuz! 


& Luis Aragon! 


DNA double-strand break repair is critical for cell viability and 
involves highly coordinated pathways to restore DNA integrity at 
the lesion. An early event during homology-dependent repair is 
resection of the break to generate progressively longer 3’ single- 
strand tails that are used to identify suitable templates for repair’. 
Sister chromatids provide near-perfect sequence homology and are 
therefore the preferred templates during homologous recombina- 
tion’. To provide a bias for the use of sisters as donors, cohesin— 
the complex that tethers sister chromatids together*—is recruited 
to the break”* to enforce physical proximity. Here we show that 
DNA breaks promote dissociation of cohesin loaded during the 
previous S phase in budding yeast, and that damage-induced dis- 
sociation of cohesin requires separase, the protease that dissolves 
cohesion in anaphase’. Moreover, a separase-resistant allele of 
the gene coding for the a-kKleisin subunit of cohesin, Mcd1 (also 
known as Sccl), reduces double-strand break resection and com- 
promises the efficiency of repair even when loaded during DNA 
damage. We conclude that post-replicative DNA repair involves 
cohesin dissociation by separase to promote accessibility to repair 
factors during the coordinated cellular response to restore DNA 
integrity. 

DNA double-strand breaks (DSBs), in which both strands of the DNA 
double helix are severed, are very hazardous as they can lead to genome 
rearrangements. In budding yeast, DSBs occurring post-replication are 
repaired by homologous recombination using sister chromatids as 
the template’. To facilitate homologous recombination between sister 
chromatids, the DNA-damage checkpoint promotes their physical 
proximity by allowing cohesion re-establishment***”. In addition, the 
proximity between sister chromatids is also reinforced at the sites of 
damage through the formation of a de novo cohesin domain at breaks**. 

Cohesin complexes that are established in S phase and hold sister 
chromatids together have an unknown fate during the repair process. 
First, we wanted to know whether S-phase cohesin maintains cohe- 
sion after a DSB in G2 phase. An assay to measure damage-induced 
cohesion’—based on a green fluorescent protein (GFP) chromosome 
tag flanked by HO endonuclease sites that produce a fragment after 
DSB formation where cohesion can be analysed microscopically 
(Fig. la)—was modified to contain an MCD1 gene regulated by the 
methionine promoter (MET). Passage through S phase in methionine- 
depleted media at 37 °C (to inactivate the mcd1-1 allele) ensured that 
cohesion was established only by MCD1 expressed from the MET 
promoter (Fig. la). Cells were blocked in metaphase before the 
expression of HO endonuclease (Fig. 1a). An increase in cohesion loss 
for the chromatin fragment was observed when MCD1 transcription 
was repressed in metaphase after DSB induction (Fig. 1b). Therefore 
S-phase cohesin is unable to maintain cohesion of the fragment after a 
DSB. Next, we followed the dynamics of S-phase cohesin by western 
analysis. After establishment of cohesion, cells were blocked in meta- 
phase and MCD1 transcription was repressed with methionine. Upon 
DSB induction full-length Mcd1 disappeared (Fig. 1c); by contrast, a 


stable pool of Mcd1 was retained when the DSB was not induced. We 
conclude that the levels of Mcd1 loaded during S phase are markedly 
reduced as a consequence of DSB induction. 

Next we asked whether constant transcription of MCD1 is required 
post DSB formation to maintain cohesion. We substituted MET for 
the GAL1-10 promoter in the strain with the GFP tags described above’. 
Expression of both HO and MCD1, promoted by galactose, was induced 
in metaphase cells (Supplementary Fig. 1). After DNA-damage-induced 
cohesion (Supplementary Fig. 1; cohesion establishment), cells were 
either maintained in galactose or transferred to glucose, which represses 
GALI-10. Deletion of silent HM cassettes (donor sequences for the 
repair) in the strain ensured the persistence of the breaks in glucose. 
Cells transferred to glucose after cohesion establishment exhibited cohe- 
sion loss (Supplementary Fig. 1; glucose) compared with cells kept in 
galactose (Supplementary Fig. 1; galactose). Similar results were 
obtained with chromosome tags located on undamaged chromosomes 
(Supplementary Fig. 1). Moreover, Mcd1 binding dynamics around a 
MAT DSB after transient MCD1 expression showed a reduction in 
binding upon transcriptional repression by glucose addition (Fig. 1d; 
glucose). Therefore an unused pool of Mcd1 is important in maintaining 
cohesion for extended periods in the presence of DNA damage. These 
results are consistent with a recent report demonstrating that DNA 
damage requires higher levels of cellular Mcd1 than cohesion establish- 
ment during S phase”. 

Next we sought to determine how Mcd1 dissociation is triggered. 
First, we considered Mcd1 de-acetylation, because Ecol-mediated 
acetylation promotes damage-induced cohesion''. We ruled out this 
possibility when we observed that an acetyl-mimicking allele coding 
for Mcd1 (Mcd1(K84/Q210)) lost cohesion at DSB-proximal regions 
after transcription repression (Supplementary Fig. 2). Next, we con- 
sidered that separase might be required for the dissociation of cohesin 
after DNA damage, as in fission yeast, separase is required for DNA 
repair, and cleavage of Rad21, the Mcd1 homologue, has been reported 
in cells exposed to DNA-damage agents’. 

We used the assay described earlier (Fig. 1b) to test whether the 
reduction of S-phase cohesin levels requires separase. A tight con- 
ditional allele of yeast separase known as esp1-C113 (ref. 13) was used 
to test this possibility. Unlike wild-type cells, esp1-C113 cells main- 
tained a stable pool of Mcd1 after DSB induction and MCD1 tran- 
scriptional repression (Fig. 2a). Therefore, Mcd1 dissociation requires 
separase. The separase cleavage sites in Mcd1 have been mapped to 
arginine residues located at positions 180 and 268 (ref. 7). We mutated 
these residues to aspartic acid to generate an Mcd1 allele resistant to 
separase cleavage, mcd1(RRDD)’. Consistent with a role for separase in 
the dissociation of Mcd1, we found that transient expression of 
mcd1(RRDD) reduced both the loss of Mcd1 from DSB-proximal 
regions (Fig. 2b) and of damage-induced cohesion (Fig. 2c) observed 
for wild-type MCD1 after transcription shutdown (Fig. Ic, d). 

DNA damage causes hypermodification of Mcd1 (ref. 12), possibly by 
Chk1-dependent phosphorylation of Mcd1 serine residue 83 (ref. 14). 


1Cell Cycle Group, MRC Clinical Sciences Centre, Imperial College, Du Cane Road, London W12 ONN, UK. 
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Figure 1 | S-phase-loaded cohesin is not sufficient to provide cohesion of a 
chromatin fragment in the presence of DNA damage. a, Diagrammatic 
representation of the modified assay’ used to measure cohesivity of S-phase 
cohesin after DSB induction. Galactose was added to promote DSBs. HO“, HO 
cut site. b, Percentage of cells showing loss of cohesion is shown (mean ~ s.e.m., 

n = 3), when incubated with or without methionine to repress further MCD1 
transcription. c, Western blot analysis of Mcd1 stability in cells containing the 
GAL-HO system, MET-MCD1 attached to nine copies of the myc epitope tag 
(MET-MCD1-9myc) and GAL-MCD1 attached to three copies of the 


We did not observe any change in mobility for S-phase-loaded Mcd1 
when DSBs were induced in metaphase-arrested cells (Figs 1b and 2a). 
However, when Mcd1 was expressed simultaneously with HO endonu- 
clease (generating DSBs), a slower migrating form of Mcd1 was observed 
(Supplementary Fig. 3). Phosphatase treatment demonstrated that this 
mobility change was due to protein phosphorylation (Supplementary 
Fig. 4). Interestingly, phosphorylated Mcd1 seemed more resistant to 
dissociation (Supplementary Fig. 3) and accumulated when separase- 
resistant Mcd1 was expressed simultaneously with the induction of a 
break (Supplementary Fig. 3). We also could detect what appeared to 
be a fragment of the carboxy-terminal region of Mcd1 (Supplementary 
Fig. 3), consistent with separase cleavage. On the basis of our findings we 
propose that in the presence of post-replicative DNA DSBs, S-phase and 
damage-induced Mcd1 can dissociate from DSB-proximal and cohe- 
sion-association regions in a separase-dependent manner. We speculate 
that this dissociation must be highly regulated and locally coordinated to 
prevent the full separation of the genome at any one time. 
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Chromosome 5 
coordinates 


SB site (kb) 


haemagglutinin epitope tag (GAL-MCD1-3HA). d, Chromatin- 
immunoprecipitation (ChIP) analysis of Mcd1-3HA binding around HO- 
induced DSB at the MAT locus in cells with transient expression of Mcd1. 
Growth at 30 °C in raffinose media (uncut sample), followed by galactose 
addition for 3 h (expressing HO and Mcd1-3HA) and transfer to glucose media 
or maintenance on galactose. ACT1 locus on chromosome 6 is shown for 
comparison (mean + s.e.m., 1 = 3). The 556-kb position is a cohesin- 
association region. 


Nevertheless, even a localized dissociation of S-phase Mcd1 in cells 
experiencing post-replicative DNA damage is surprising, as cohesion 
is thought to be required for accurate repair. It is possible however, that 
local cohesion removal is necessary for repair. DSB repair by homo- 
logous recombination initiates with 5’-3’ resection of the break to 
yield 3' single-stranded DNA tails’. In budding yeast, resection is 
initiated by the MRX (Mrel1-Rad50-Xrs2) complex’*”®, and more 
extensive resection is accomplished by the combined activities of the 
Sgs1-Top3-Rmil complex and Dna? (ref. 16) and/or the exonuclease 
Exol (ref. 17). The MRX subunit Mre11 localized to DSBs when cohe- 
sin dissociation was blocked in cells expressing the Mcd1 separase- 
resistant allele (Supplementary Fig. 5); however, Sgs1 recruitment was 
reduced in these cells (Fig. 3a), raising the possibility that break resec- 
tion might be slower or inefficient in the absence of Mcd1 dissociation. 
To evaluate this, we compared break-resection dynamics in cells 
expressing wild-type or separase-resistant Mcd1 (Fig. 3b). After syn- 
chronous HO-induced cleavage, the rate of resection was evaluated 
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Figure 2 | Mcd1 dissociation in the presence of DNA damage requires 
separase. a, Western blot analysis of Mcd1 stability in wild-type and esp1-C113 
cells carrying the GAL-HO system and MET-MCD1-9myc. Coomassie stain 
shows equal loading on the blots. b, ChIP analysis of Mcd1(RRDD)-3HA 
binding around the HO-induced DSB at the MAT locus in cells during the 
growth conditions described in Fig. 1d, before DSB (uncut), after the addition of 
galactose for 5 h (galactose) and after the addition of galactose for 3 h followed 
by transfer to glucose for 2h (glucose). Binding at the ACT1 locus on 


using probes at different distances from the HO break. As the 5’ strand 
is degraded, restriction enzymes are unable to cleave single-stranded 
DNA, and the intensity of the bands corresponding to the DNA frag- 
ments by Southern blot hybridization becomes diminished. Resection 
in cells expressing separase-resistant Mcd1 was attenuated (Fig. 3b and 
Supplementary Fig. 6). A similar result was found when resection 
intermediates were analysed as transient smears of higher molecular 
weight (Supplementary Fig. 7). These results demonstrate that, in the 
absence of Mcd1 dissociation, cohesin complexes can interfere with 
resection of 5’ termini. 
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Chromosome 3 
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mcd1-1 mcd1-1 


chromosome 6 is shown for comparison (mean = s.e.m., n = 3). Pulse-field gel 
electrophoresis to confirm DSB occurrence is shown. ¢, Percentage of mcd1-1, 
mced1-1/GAL-MCD1 and mcd1-1/GAL-mcd1(RRDD) cells showing loss of 
cohesion (as described in Fig. 1d,) after 2h of galactose addition (cohesion 
establishment), after 3.5 h in galactose (galactose; MCD1 expressed) and after 
2h in galactose followed by 1.5h in glucose (glucose; MCD1 repressed) 
(mean = s.e.m, n = 4) (left). Pulse-field gel electrophoresis to confirm DSB 
occurrence is shown (right). 


Next, we tested whether blocking Mcd1 dissociation impairs 
DNA-repair events that require extended end resection. To this aim, 
we used a yeast strain engineered to contain a 117-base-pair HO- 
endonuclease-recognition sequence within the LEU2 gene on chro- 
mosome 3, and carrying homology with only the right end of the break 
(the ‘U2’ end) 25kilobases (kb) away'*’? (Supplementary Fig. 8). 
Repair largely occurs through single-strand annealing’’, but also 
can occur by Rad51-dependent intra-chromosomal break-induced 
replication (Supplementary Fig. 8). Cells expressing separase-resistant 
Mcd1 exhibited a reduction in repair product (Fig. 4a), demonstrating 
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Figure 3 | Mcd1 dissociation facilitates 5’-3' end resection. a, ChIP analysis 
of Sgs1 binding around the HO-induced DSB at the MAT locus in cells 
expressing wild-type Mcd1 or Mcd1(RRDD). Cells carrying SGS1-3HA under 
the endogenous promoter and the HO endonuclease system together with either 
GAL-MCD1 or GAL-mcd1(RRDD) were grown at 30 °C in raffinose media 
(uncut), and expression of HO and either MCD1 or mcd1(RRDD) was induced 


that Mcd1 dissociation is important for efficient repair of the break 
in this assay. Expression of separase-resistant Mcd1 in a rad514 
derivative strain, in which break-induced replication is inactive, 
also exhibited reduced repair kinetics (Supplementary Fig. 8). 
Moreover, analysis of MAT-switching kinetics—which does not 
involve extensive resection—in cells expressing separase-resistant 
Mcd1 showed that although switching was efficiently achieved, 
the formation of recombinant product was delayed compared with 
wild-type Mcdl (Supplementary Fig. 9). These data demonstrate 
that local cohesin dissociation facilitates the timely repair of DNA 
DSBs. 
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for 3 h. Binding at the ACT1 locus on chromosome 6 is shown for comparison 
(mean = s.e.m., n = 3). b, Plots show quantification of band intensities 
corresponding to each probe used over time on Southern blot analysis 
measuring strand resection in wild-type cells expressing an empty vector 
(MCD1; WT), GAL-MCD1 or GAL-mcd1(RRDD) (mean = s.e.m., n = 3). 


On the basis of our results we propose that upon DSB formation 
post-replication, cohesin complexes loaded during S phase, and to 
some extent those loaded de novo at DSB-proximal regions, dissociate 
from chromosomes (Supplementary Fig. 10). Importantly, cohesin 
dissociation involves separase, hence preventing the use of dissociated 
Mcd1 in a new round of cohesion generation. The functional role of 
cohesin dissociation is to allow local accessibility for repair activities, 
such as resection, at sites of damage (Supplementary Fig. 10). Separase 
activity must be tightly regulated during DNA repair, perhaps with 
targeted or attenuated proteolytic activity (compared to anaphase), to 
ensure context-dependent cleavage of Mcd1. The detailed mechanisms 
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Figure 4 | Mcd1 dissociation facilitates repair of DNA DSB. a, Top, 
Southern blot showing kinetics of repair in strains carrying the donor U2 
sequence 25 kb away from the HO cut site and expressing either a wild-type 
copy of Mcd1 under the GAL1-10 promoter (GAL-MCD1) or a non-cleavable 
mutant of Mcd1 also under GAL1-10 promoter control (GAL-mcd1(RRDD)). 
Cells were first grown at 30 °C in raffinose media (uncut) before expression of 
HO and either MCD1 or mcd1(RRDD) by addition of galactose was induced. 
Samples were collected at the indicated time points and DNA was digested with 
KpnI and probed with a U2-specific probe. The bands representing donor, 
uncut, product and cut species are indicated. Bottom, plots show quantification 
of band intensities corresponding to each band (donor, uncut, product and cut) 
over time (mean + s.e.m., 1 = 3). 


of this regulation are a fascinating question for the future. Interesting- 
ly, if cohesin dissociation is conserved in mammalian models, thera- 
peutic approaches that rely on the generation of DNA lesions could be 
improved by simultaneously inhibiting the cohesin cycle. 


METHODS SUMMARY 

Induction of DSBs and shut-off assays. DSBs were induced in strains containing 
stably integrated GAL1-10::HO and MCD1-3(or -6)::HA alleles, either on centro- 
meric plasmids (carrying the LEU2 marker) or stably integrated. Cells were grown 
in YEP media with 2% lactic acid or 2% raffinose. Galactose (Sigma, 20% w/v 
stock) was added to the culture at a final concentration of 2% w/v. For ChIP 
experiments cells were incubated in galactose, to induce alleles coding for HO 
and Mcdl, for 4h at 30°C, and the culture was then split in two. One half was 
maintained in 2% galactose, whereas the remainder of the culture was incubated in 
2% glucose. Samples were collected at different time points (as indicated in figure 
legends) and processed for ChIP. For experiments to analyse damage-induced 
cohesion, cells were arrested in metaphase with 15 pg ml! nocodazole (1:100 
dilution from 1.5 mg ml ! in dimethylsulphoxide (DMSO)) before addition of 
galactose to induce HO and Mcd1 expression, and were incubated for 1h at 
30 °C, then transferred to 37.5 °C to inactivate mcd1-1. After incubation a second 
dose of nocodazole was added (15 mg ml ') to prevent anaphase entry. One half of 
the culture was then maintained in 2% galactose, whereas the remainder of the 
culture was incubated in 2% glucose. Samples were collected at different time 
points (as indicated in figure legends) and processed for microscopy. For extended 
arrests in metaphase a second dose of nocodazole was added after 3h to prevent 
cells entering anaphase. 
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DSB resection and SSA assays. The restriction enzyme used for 5’-3' resection 
analysis was Styl (New England Biolabs). The digested fragments were resolved on 
agarose gels and transferred to GeneScreen Plus hybridization membrane either 
by vacuum or capillary wet transfer. The DNA probes used for Southern blotting 
were prepared by Amersham DNA labelling beads (—dCTP) and purified by 
Amersham G-50 Micro Columns. For single-strand annealing experiments the 
restriction enzyme used was KpnI. 


Full Methods and any associated references are available in the online version of 
the paper. 


Received 9 December 2011; accepted 28 September 2012. 
Published online 25 November 2012. 


1. Paques, F. & Haber, J. E. Multiple pathways of recombination induced by double- 
strand breaks in Saccharomyces cerevisiae. Microbiol. Mol. Biol. Rev. 63, 349-404 
(1999). 

Kadyk, L. C. & Hartwell, L. H. Sister chromatids are preferred over homologs as 

substrates for recombinational repair in Saccharomyces cerevisiae. Genetics 132, 

387-402 (1992). 

3. Johnson,R.D.&Jasin, M. Sister chromatid gene conversion is a prominent double- 
strand break repair pathway in mammalian cells. EMBOJ. 19, 3398-3407 (2000). 

4. Nasmyth, K. Cohesin: a catenase with separate entry and exit gates? Nature Cell 
Biol, 13, 1170-1177 (2011). 

Unal, E. etal. DNA damage response pathway uses histone modification to assemble 
a double-strand break-specific cohesin domain. Mol. Cell 16, 991-1002 (2004). 

6. Strom, L, Lindroos, H. B., Shirahige, K. & Sjogren, C. Postreplicative recruitment of 
cohesin to double-strand breaks is required for DNA repair. Mol. Cel! 16, 
1003-1015 (2004). 

7. Uhlmann, F., Lottspeich, F. & Nasmyth, K. Sister-chromatid separation atanaphase 
onset is promoted by cleavage of the cohesin subunit Sccl. Nature 400, 37-42 
(1999). 

8. Strom, L. et al. Postreplicative formation of cohesion is required for repair and 
induced by a single DNA break. Science 317, 242-245 (2007). 

9. Unal, E., Heidinger-Pauli, J. M. & Koshland, D. DNA double-strand breaks trigger 
genome-wide sister-chromatid cohesion through Ecol (Ctf7). Science 317, 
245-248 (2007). 

10. Heidinger-Pauli, J.M., Mert, O., Davenport, C., Guacci, V. & Koshland, D. Systematic 

reduction of cohesin differentially affects chromosome segregation, 

condensation, and DNA repair. Curr. Biol. 20, 957-963 (2010). 

11. Heidinger-Pauli, J. M., Unal, E. & Koshland, D. Distinct targets of the Ecol 

acetyltransferase modulate cohesion in S phase and in response to DNA damage. 

Mol. Cell 34, 311-321 (2009). 

12. Nagao, K., Adachi, Y. & Yanagida, M. Separase-mediated cleavage of cohesin at 

interphase is required for DNA repair. Nature 430, 1044-1048 (2004). 

13. Baskerville, C., Segal, M. & Reed, S. |. The protease activity of yeast separase (esp 1) 

is required for anaphase spindle elongation independently of its role in cleavage of 

cohesin. Genetics 178, 2361-2372 (2008). 

14. Heidinger-Pauli, J. M., Unal, E., Guacci, V. & Koshland, D. The kleisin subunit of 

cohesin dictates damage-induced cohesion. Mol. Cell 31, 47-56 (2008). 

15. Zhu,Z., Chung, W.H.,Shim,E.Y., Lee, S. E. & lra,G. Sgs1 helicase and two nucleases 

Dna2 and Exol resect DNA double-strand break ends. Cell 134, 981-994 (2008). 

16. Mimitou, E. P. & Symington, L. S. Sae2, Exol and Sgs1 collaborate in DNA double- 

strand break processing. Nature 455, 770-774 (2008). 

17. Shim, E. Y. et al. Saccharomyces cerevisiae Mrel1/Rad50/Xrs2 and Ku proteins 

regulate association of Exol and Dna2 with DNA breaks. EMBO J. 29, 3370-3380 

(2010). 

18. Vaze, M. B. et al. Recovery from checkpoint-mediated arrest after repair of a 

double-strand break requires Srs2 helicase. Mol. Cell 10, 373-385 (2002). 

19. Jain, S. et al. A recombination execution checkpoint regulates the choice of 
homologous recombination pathway during DNA double-strand break repair. 
Genes Dev. 23, 291-303 (2009). 


N 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We would like to thank D. Koshland, E. Unal, J. Haber, G. lra and 
K. Nasmyth for providing valuable strains. We thank members of the Aragon laboratory, 
P. Ullal, S. Farmer, K. Pegram, J. Torres-Rosell and M. Merkenschlager for discussions 
and reading of the manuscript. This work was funded by the Intramural Research 

Program of the Medical Research Council UK. 


Author Contributions A.M. made the original observation that transcriptional 
repression of MCD1 in the presence of DNA damage causes defects in sister 
chromatid cohesion. A.M. and A.C.-B. performed all ChIP analysis. A.M., A.C.-B., N.S. and 
V.C.-P. performed microscopic analysis of cohesion. A.C.-B. and M.E. performed 
physical assays to measure resection and repair. A.M. and A.C.-B. performed 
biochemical assays to measure Mcd1 stability. AJ. provided technical support. LA. 
wrote the paper, and all authors analysed the data, discussed the results and 
commented on the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to LA. (luis.aragon@csc.mrc.ac.uk). 


©2013 Macmillan Publishers Limited. All rights reserved 


METHODS 

Yeast strains and DNA constructs. Strains used in this study are isogenic to 
BY4741 (MATa, his3A1, leu2A0, met15A0, ura3A0) or W303 (MATa, can1-100, 
leu2-3, his3-11, trp1-1, ura3-1, ade2-1), except for the strains with the HO system 
(obtained from J. Haber). Strains used are listed in Supplementary Table 1. All 
DNA constructions were done using Gene-synthesis service from GeneCust. 
Yeast media and cell cycle synchronization. For G1 arrests, «-factor was added 
to exponentially growing MATa cultures (D¢o0 nm ~0.5) to a final concentration 
of 1.510 °M for bar1A strains and 3 X 10 °M for BARI strains. Cultures 
were monitored by microscopy until =90% of cells were arrested in G1 (typically 
2-3 h). To inactivate temperature-sensitive alleles in G1, cells were shifted to 37 °C 
for 30 min before release. To release cells, the culture was spun (4,000 r.p.m., 
2 min) and the pellet washed three times in a large volume of pre-warmed 
medium containing 0.1 mgml | pronase. The pellet was then re-suspended in 
fresh medium and samples taken at the indicated times. Nocodazole was used to 
arrest cells in G2/M (1.5 mg ml ? stocks in dimethylsulphoxide (DMSO)). To 
repress transcription from the methionine promoter we added 5 mM methionine 
to the cultures. 

Induction of DSBs and shut-off assays. DSBs were induced in strains containing 
stably integrated GAL1-10::HO and MCD1-3(or -6)::HA alleles, either on centro- 
meric plasmids (carrying the LEU2 marker) or stably integrated. Cells were grown 
in YEP media with 2% lactic acid or 2% raffinose. Galactose (Sigma, 20% w/v 
stock) was added to the culture at a final concentration of 2% w/v. For ChIP 
experiments cells were incubated in galactose to induce alleles coding for HO 
and Mcd1 for 4h at 30 °C, the culture was then split in two. One half was main- 
tained in 2% galactose whereas the remainder of the culture was incubated in 2% 
glucose. Samples were collected at different time points (as indicated in figure 
legends) and processed for ChIP. For experiments to analyse damage-induced 
cohesion, cells were arrested in metaphase with 15 pg ml! nocodazole (1:100 
dilution from 1.5mgml * in DMSO) before addition of galactose to induce HO 
and Mcd1 expression, and were incubated for 1h at 30°C, then transferred to 
37.5°C to inactivate mcd1-1. After incubation a second dose of nocodazole was 
added (15 j1gml') to prevent anaphase entry. One half of the culture was then 
maintained in 2% galactose whereas the remainder of the culture was incubated in 
2% glucose. Samples were collected at different time points (as indicated in figure 
legends) and processed for microscopy. For extended arrests in metaphase a 
second dose of nocodazole was added after 3 h to prevent cells entering anaphase. 
Microscopy. For microscopy, series of Z-focal plane images were collected on a 
Leica IRB microscope using a Hamamatsu 742-95 digital camera and OpenLab 
software (Improvision). A tunable light source (Polychrome IV (Photonics)) with 
a xenon lamp or an ultraviolet mercury lamp (Leica) was used. Images in different 
z axis planes were flattened into a two-dimensional projection and processed in 
OpenLab. To visualize the nuclei of intact cells, cells were re-suspended in a final 
concentration of 1% Triton X-100 and 25ngml~' DAPI/Antifade. For each 
experimental condition, at least 200 cells were scored for GFP spots. 

ChIP. For ChIP analysis, 50 ml of culture (at Dgoo nm ~0.5) was fixed with form- 
aldehyde (final concentration, 1.42%) for 15min at room temperature (20°C). 
The formaldehyde was quenched with glycine (final concentration, 125 mM) for 
5 min at room temperature and the cells collected by centrifugation (4,000 r.p.m., 
2 min). The pellet was washed with PBS buffer, transferred to a screw-cap tube and 
snap-frozen in liquid nitrogen. Pellets were stored at —80°C. Pellets were re- 
suspended in 100 pl IP buffer (150 mM NaCl, 50mM Tris-HCl, pH7.5, 5mM 
EDTA, NP-40 (0.5% v/v), Triton X-100 (1.0% v/v)) containing phenylmethylsul- 
phony] fluoride (PMSF) (final concentration, 1 mM) and complete protease inhi- 
bitor cocktail (Roche), and 500 ul glass beads were added. Cells were lysed by two 
20-s cycles on power setting 6.5 in a FastPrep FP120 (BIO 101) machine, with 
5 min on ice in between cycles. 300 ul IP buffer containing PMSF and protease 
inhibitors was then added. Tubes were pierced with a hot needle and placed on top 
of fresh eppendorfs and spun (1,000 r.p.m., 2 min) to collect lysate minus glass 
beads. Samples were pelleted (13,000 r.p.m., 1 min, 4°C). The nuclear pellet was 
re-suspended thoroughly in 1 ml IP buffer containing PMSF and protease inhibi- 
tors. The chromatin was sonicated for 1h (15s on, 15 s off at high power at 4 °C) 
(Diagenode Bioruptor). At this point, sonicated chromatin could be stored at 
—80°C. 120 ul (30% of sample) was taken for input and 400 pl for IP. After 
clarification (13,000 r.p.m., 1 min) input DNA was precipitated from the super- 
natant by the addition of KaOAc to a final concentration of 0.3M and 2.75 
volumes of ethanol. The mixture was incubated at —20°C for 2h or overnight, 
and then spun (13,000 r.p.m., 5 min) and the supernatant discarded. 100 pl 10% 


LETTER 


(w/v) Chelex 100 suspension was added to the dried pellet and the sample boiled 
for 20 min. The supernatant was cleaned using the PCR purification kit (Qiagen) 
according to the manufacturer’s instructions, and the DNA eluted in 250 ul water. 
DNA was stored at —20 °C. 2 ug anti-HA antibody (Roche) was added to 400 pl 
chromatin for the IP, and the samples were incubated in an ultrasonic water bath 
for 30 min at 4 °C. After clarification (13, 000 r.p.m., 2 min, 4°C) the supernatant 
was added to 100 jl of a 50:50 slurry of Protein A and Protein G beads (Roche), 
which had been pre-equilibrated in IP buffer. The sample and beads were incu- 
bated for 2h or overnight on a rotating platform at 4°C. The beads were then 
washed three times in IP buffer, and then 250 pl 10% (w/v) Chelex 100 suspension 
added and the sample boiled for 20 min. After spinning down (13,000 r.p.m., 
1 min) the supernatant was transferred into a fresh tube and stored at —20 °C. 
Real-time PCR. PCR reactions were performed using the Sensimix NoRef Kit 
(Quantance). Reactions were carried out according to the manufacturer’s instruc- 
tions in a total volume of 20 pl containing 3 jl input or IP DNA and oligonucleo- 
tide primer pairs (final concentration, 1.5 tM). Amplification was performed in a 
DNA Engine Opticon 2 thermal cycler and analysed using Opticon software (MJ 
Research) or in a Bio-Rad C1000 Thermal Cycler in conjunction with the Bio-Rad 
CFX96 Real-time-system and analysed using CFX manager (Bio-Rad). The pri- 
mers used for the analysis of an HO break at the MAT locus are shown in 
Supplementary Table 2. 

Sample preparation and western blot analysis. Samples were prepared by tri- 
chloroacetic acid (TCA) extraction. Extracts were prepared as follows: cells were 
collected by centrifugation (4,000 r.p.m., 2 min) and washed with 20% TCA. The 
TCA was aspirated and the pellets frozen at —80°C. All of the following puri- 
fication steps were performed on ice with pre-chilled solutions. Cells were re- 
suspended in 250 ull 20% TCA, glass beads were added and the cells broken by 
one 40-s cycle at power setting 5.5 in a FastPrep FP120 (BIO 101) machine. Tubes 
were pierced with a hot needle and placed onto of fresh eppendorfs and spun 
(1,000 r.p.m., 2 min) to collect lysate minus glass beads. The glass beads were 
washed with 1 ml 5% TCA and this was added to the lysate, and mixed by pipet- 
ting. The precipitated proteins were collected by centrifugation (14,000 r.p.m., 
10 min, 4°C) and then pellets were washed with 750 pl 100% ethanol. Proteins 
were solubilized in 50 ul 1M Tris, pH 8, and 100 ul (x2) SDS-PAGE loading 
buffer (60mM Tris, pH 6.8, 2% SDS, 10% glycerol, 0.2% bromophenol blue) 
and boiled for 5 min at 95 °C. Insoluble material was removed by centrifugation 
(14,000 r.p.m., 5 min, room temperature) and the supernatant either stored at 
—20 °C or loaded immediately onto an SDS-PAGE mini-gel. Samples were either 
run on an 8% acrylamide gel in Tris-Glycine SDS running buffer using the Bio-Rad 
Mini-PROTEAN 3 system or on Pre-Cast 4—12% Bis-Tris gels in NuPAGE MOPS 
(all Invitrogen). SDS-PAGE gels were transferred to polyvinylidene fluoride trans- 
fer membrane (Hybond-P, Amersham Biosciences) in either the Bio-Rad Mini 
Trans-Blot Electrophoretic Transfer Cell or by using the XCell SureLock Mini Cell 
transfer module (Invitrogen). The Bio-Rad system was used in conjunction with 
Tris-Glycine blotting buffer (National Diagnostics) containing 20% methanol, and 
run for 1h at 200 V or overnight at 30 V. The NuPAGE system was used with 
NuPAGE transfer buffer containing 20% methanol and run for 1h at 30V. 
Membranes were blocked in 5% skimmed-milk powder in PBS with 0.1% 
Tween 20 (PBS-T) for 1h or overnight at 4°C, then incubated with the anti-HA 
IgG1 antibody 12CA5 (Roche) at a 1/5,000 dilution in blocking solution for 
between 1 h at room temperature to overnight at 4°C. After several washes in 
PBS-T, membranes were incubated with the sheep anti-mouse IgG horseradish 
peroxidase-linked antibody (GE Healthcare) at a 1/10,000 dilution in blocking 
solution. After several further washes in PBS-T, membranes were incubated with 
the ECL Plus Western Blotting Detection system (GE Healthcare), followed by 
exposure to ECL Hyperfilm (GE Healthcare) to detect the secondary antibody. 
Pulse-field gel electrophoresis. Pulse-field gel electrophoresis was performed by 
running genomic DNA embedded in plugs of 1% agarose, at 200 V, for 40 h (initial 
time, 10s; final time, 35s). DNA plugs were made using a Bio-Rad CHEF yeast 
genomic DNA plug kit, according to the manufacturer’s protocol. 

DSB resection and single-strand annealing assays. Genomic DNA was prepared 
as described. The restriction enzyme used for 5'-3' resection analysis was Styl 
(New England Biolabs). The digested fragments were resolved on agarose gels and 
transferred to GeneScreen Plus hybridization membrane either by vacuum or by 
capillary wet transfer. The DNA probes used for Southern blotting were prepared 
by Amersham DNA-labelling beads (—dCTP) and purified by Amersham G-50 
Micro Columns. For single-strand annealing experiments the restriction enzyme 
used was Kpnl. The experiment was performed as described in refs 18 and 19. 
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Control of somatic tissue differentiation by the long 


non-coding RNA TINCR 


Markus Kretz', Zurab Siprashvili*, Ci Chu', Dan E. Webster, Ashley Zehnder', Kun Qu', Carolyn S. Lee!, Ross J. Flockhart’, 
Abigail F. Groff', Jennifer Chow', Danielle Johnston!, Grace E. Kim!, Robert C. Spitale’, Ryan A. Flynn’, Grace X. Y. Zheng’, 
Subhadra Aiyer?, Arjun Raj’, John L. Rinn*, Howard Y. Chang!* & Paul A. Khavari>® 


Several of the thousands of human long non-coding RNAs 
(IncRNAs) have been functionally characterized’ *; however, 
potential roles for IncRNAs in somatic tissue differentiation 
remain poorly understood. Here we show that a 3.7-kilobase 
IncRNA, terminal differentiation-induced ncRNA (TINCR), con- 
trols human epidermal differentiation by a post-transcriptional 
mechanism. TINCR is required for high messenger RNA abun- 
dance of key differentiation genes, many of which are mutated in 
human skin diseases, including FLG, LOR, ALOXE3, ALOX12B, 
ABCA12, CASP14 and ELOVL3. TINCR-deficient epidermis 
lacked terminal differentiation ultrastructure, including kerato- 
hyalin granules and intact lamellar bodies. Genome-scale RNA 
interactome analysis revealed that TINCR interacts with a range 
of differentiation mRNAs. TINCR-mRNA interaction occurs 
through a 25-nucleotide “TINCR box’ motif that is strongly 
enriched in interacting mRNAs and required for TINCR binding. 
A high-throughput screen to analyse TINCR binding capacity to 
approximately 9,400 human recombinant proteins revealed direct 
binding of TINCR RNA to the staufen1 (STAU1) protein. STAU1- 
deficient tissue recapitulated the impaired differentiation seen 
with TINCR depletion. Loss of UPF1 and UPF2, both of which 
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Figure 1 | TINCR is induced during epidermal differentiation. a, Mean- 
centred, hierarchical clustering of 258 annotated non-coding RNAs altered 
(>twofold change) in undifferentiated cells (day 0) and during days of calcium- 
induced differentiation in vitro. b, Schematic of TINCR genomic locus on 
chromosome 19. Day 0, 3 and 6 of keratinocyte (KC) differentiation; blue 
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are required for STAU1-mediated RNA decay, however, did not 
have differentiation effects. Instead, the TINCR-STAU1 complex 
seems to mediate stabilization of differentiation mRNAs, such as 
KRT80. These data identify TINCR as a key IncRNA required for 
somatic tissue differentiation, which occurs through IncRNA bind- 
ing to differentiation mRNAs to ensure their expression. 
LncRNAs regulate a variety of processes'*°, yet their effects on 
homeostasis in somatic tissues such as epidermis are not fully defined. 
Transcriptome sequencing of progenitor and differentiating human 
keratinocytes was undertaken using the Illumina paired-end HiSeq 
platform with a read length of 101 nucleotides at about 110 million 
mapped reads per sample, with consistency verified by quantitative 
reverse transciptase PCR (qRT-PCR) (Supplementary Fig. la-c). 
TINCR, an uncharacterized IncRNA°, was among the most highly 
induced IncRNAs of the 258 annotated non-coding RNAs changing 
during differentiation (Fig. la and Supplementary Table 1). The 
TINCR gene resides on chromosome 19 in humans between the 
SAFB2 and ZNRF4 genes (Fig. 1b), a locus conserved with a syntenic 
non-coding region on mouse chromosome 17qD. TINCR produces a 
3.7-kilobase (kb) transcript induced >150-fold during epidermal dif- 
ferentiation (Fig. 1c-e). TINCR is downregulated in human squamous 
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rectangles represent exons. c, Relative TINCR abundance in fragments per 
kilobase of exon model per million mapped fragments (FPKM). d, TINCR 
qRT-PCR. Error bars are s.d., n = 4. e, Northern blot analysis, with TINCR the 
single band seen in differentiation; bp, base pairs. 
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cell carcinoma specimens, consistent with decreased differentiation 
seen in squamous cell carcinomas (Supplementary Fig. 1d). Single- 
molecule RNA fluorescence in situ hybridization (FISH) identified 
80.6% of TINCR molecules newly acquired during differentiation 
within the cytoplasm (Supplementary Fig. le, f). FISH in human epi- 
dermis showed enrichment of TINCR in differentiated layers (Sup- 
plementary Fig. 1g). TINCR is therefore a differentiation-induced, 
predominantly cytoplasmic IncRNA. 

TINCR function was assessed by RNA interference in organotypic 
human epidermal tissue, a setting that recapitulates the structure and 
gene expression of human epidermis”*®. Although TINCR-deficient 
epidermis stratified normally, the expression of key differentiation 
genes mutated in human diseases of abnormal epidermal function”! 
was markedly reduced at the protein (Fig. 2a) and mRNA (Fig. 2b) 
levels. TINCR is thus required for normal induction of key protein 
mediators of epidermal differentiation. 

Transcript profiling of TINCR-depleted epidermis demonstrated 
that TINCR loss disrupted the expression of 394 genes (Supplemen- 
tary Fig. 2a and Supplementary Table 2). TINCR-regulated genes were 
enriched for differentiation-associated epidermal barrier formation- 
related Gene Ontology (GO) terms (Fig. 2c). Barrier formation 
requires genes encoding the protein structure of the terminally differ- 
entiated stratum corneum, such as loricrin and filaggrin, as well as 
those synthesizing specific water-impermeable lipids’*. GO terms 
related to the latter were enriched in genes altered by TINCR loss, as 
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Figure 2 | TINCR regulates epidermal differentiation genes involved in 
barrier formation. a, Loss of differentiation proteins in TINCR-depleted 
organotypic human epidermis by independent TINCR siRNAs (siTINCRA and 
siTINCRB) versus scrambled control (siControl); nuclei stained blue (Hoechst 
33342). Scale bars, 50 jim. b, mRNA expression in TINCR-deficient tissue 
versus control; duplicate biological replicates for duplicate independent TINCR 
siRNAs. Error bars are s.d., n = 6. c, GO terms significantly enriched in the 
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were the mRNA levels of genes in this subset that are genetically non- 
redundant for epidermal barrier formation’*» (Fig. 2d). Furthermore, 
caspase 14, implicated in proteolysis needed for epidermal barrier 
function'’, was diminished by 83.7% with TINCR loss. Protein and 
lipid barrier ultrastructures involved in barrier formation were ab- 
normal in the outer layers of TINCR-deficient epidermis, including 
protein-rich keratohyalin granules (Fig. 2e) and the lipid-rich lamellar 
bodies (Fig. 2f). Deficiencies in these structures are characteristic of 
human genodermatoses with abnormal skin barrier function, includ- 
ing ichthyosis vulgaris and harlequin ichthyosis. No regions of normal 
keratohyalin granule formation were observed in TINCR-deficient 
epidermis, and the number of lamellar bodies in the stratum granu- 
losum of TINCR-deficient human epidermal tissue was reduced by 
81.4%. TINCR is thus required for the induction of genes that form the 
cellular structures that mediate differentiation-associated epidermal 
barrier formation. 

To determine the mechanisms of TINCR action, we developed two 
assays to analyse the TINCR RNA and protein interactome (Sup- 
plementary Fig. 2d). Given its cytoplasmic location (Supplementary 
Fig. le), TINCR control of epidermal barrier genes may occur at 
the post-transcriptional level through direct association with target 
mRNAs. To test this, we developed RNA interactome analysis, fol- 
lowed by deep sequencing (RIA-Seq) (Supplementary Fig. 2d, left). 
Thirty-eight biotinylated DNA probes (Supplementary Table 3) were 
designed in even- and odd-numbered pools (Supplementary Fig. 2e). 
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TINCR-depleted gene subset. d, mRNA expression of lipid barrier synthesis 
genes in TINCR-depleted tissue. Error bars denote s.d., n = 4. e, Loss of 
protein-rich keratohyalin granules (arrows in control) in TINCR-deficient 
organotypic human epidermis. St, stratum. Scale bars, 10 jum. f, Loss of normal 
lipid-containing lamellar bodies (arrows in control, top image) in TINCR- 
depleted tissue (bottom image) (n = 3). Scale bars, 100 nm. 
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These two pools were used separately in a multiplex fashion for pull- 
down of endogenous TINCR and associated RNAs in differentiated 
keratinocytes (Supplementary Fig. 2d, left), similar to recent ap- 
proaches to single-molecule RNA FISH” and chromatin isolation by 
RNA purification’. To discover transcripts enriched by TINCR pull- 
down, a 100-base-pair (bp) sliding window compared the even and 
odd signal to input across the human transcriptome (Supplementary 
Fig. 3a), resulting in the discovery of 3,602 enriched sites. GO analysis 
of TINCR-interacting genes showed enrichment of differentiation- 
associated genes (Fig. 3a). TINCR binding is enriched in mRNAs that 
are downregulated following TINCR knockdown (P = 3.97 X 10’). 
RIA-Seq results were confirmed by RNA interactome analysis and 
qRT-PCR (Supplementary Fig. 3b). These data are consistent with 
potential TINCR action by post-transcriptional differentiation gene 
regulation. 
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Figure 3 | TINCR interacts with differentiation mRNAs and STAU1 
protein. a, Enriched GO terms in TINCR-interacting genes detected by RIA- 
Seq. b, Protein microarray analysis detects TINCR RNA binding to STAU1 
protein. Human recombinant protein microarray spotted with approximately 
9,400 proteins (left); enlarged 144 protein spot subarray (middle) 
demonstrating strand-specific binding of TINCR sense strand to STAU1 
protein (right); DUPD1 protein negative control is shown. Alexa-Fluor-647- 
labelled rabbit anti-mouse IgG in the top left corner of each subarray. c, STAU1 
protein immunoprecipitation pulls down TINCR RNA. ANCR and LINCI (also 
known as XIST) represent IncRNA controls. d, Streptavidin precipitation of in 
vitro synthesized biotinylated TINCR RNA pulls down STAU1 protein. HA, 
haemagglutinin; WB, western blot. 
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LncRNAs can act together with specific proteins**””’. To identify 
TINCR-binding proteins of relevance to epidermal differentiation 
control, we developed human protein microarray analysis (Sup- 
plementary Fig. 2d, right). TINCR sense and antisense RNA were 
transcribed with Cy5 and independently hybridized to a protein 
microarray containing approximately 9,400 recombinant human pro- 
teins (Human ProtoArray). STAU1 protein displayed the strongest 
TINCR RNA binding (Fig. 3b and Supplementary Fig. 3c). Recip- 
rocal binding was confirmed by two-way ribonucleoprotein complex 
pull-down experiments (Fig. 3c, d). STAU1 is a known RNA-binding 
protein” * first identified in Drosophila as a mediator of RNA locali- 
zation in oocytes” ; however, a role for STAU1 in epidermal differenti- 
ation has not been described. Similar to TINCR loss, STAU]1 deficiency 
phenocopied impaired differentiation of epidermal tissue (Fig. 4a and 
Supplementary Fig. 3d). Transcript profiling of STAU1-deficient epi- 
dermis showed significant overlap of STAU1- and TINCR-regulated 
genes (42.5% overlap for siSTAU1 (short interfering RNA (siRNA) 
against STAUI), 47.8% for siTINCR, P= 1.24 10 *””) (Fig. 4b, 
Supplementary Fig. 3e and Supplementary Table 4). Gene set enrich- 
ment analysis (GSEA)** of siSTAU1 as well as siTINCR gene sets 
showed marked overlap with the keratinocyte differentiation signature 
published previously’ (Supplementary Fig. 3f, g) indicating that 
TINCR together with STAU1 is required for epidermal differentiation. 

To study TINCR interaction with differentiation mRNAs, the top 
1,500 TINCR enriched sites detected by RIA-Seq were subjected to a de 
novo motif search. This identified a 25-nucleotide motif that was 
strongly enriched in TINCR-interacting mRNAs and also repeated 
within TINCR itself, termed the TINCR box (Fig. 4c, Supplementary 
Fig. 4 and Supplementary Table 5). A reverse search for the TINCR 
box in all TINCR-enriched sites using the find individual motifs occur- 
rences (FIMO) algorithm yielded sequence similarity (P< 10 X 10 *) 
for 96.3% of the 3,602 sites present. By contrast, a reverse search of the 
TINCR box in the least TINCR-enriched sites showed sequence simi- 
larity (P<1X10~*) for 11.3% of analysed 2,567 sites, indicating 
strong enrichment of the TINCR box motif in TINCR-interacting 
mRNAs. To investigate whether the TINCR box is also relevant for 
interaction with TINCR- and STAU1-regulated mRNAs, TINCR- 
binding transcripts were overlapped with those showing differential 
expression in TINCR- and STAU1-deficient epidermis. Reverse motif 
search with enriched sequences of the resulting 31 genes (Sup- 
plementary Fig. 5a) also showed high occurrence of the TINCR box 
(Fig. 4c). Of note, 3 out of those 31 mRNAs were previously shown to 
interact with STAU]1 protein in a kidney cell line’. Analysis of TINCR 
secondary structure in solution by selective hydroxyl acylation ana- 
lyzed by primer extension (SHAPE)** showed that TINCR boxes of 
three, six, seven and ten reside in open RNA structure conformations 
(Supplementary Fig. 6a, b) that may be amenable to TINCR-target 
interactions. By contrast, analysis of microRNA (miRNA) seed 
sequences showed no enrichment for any miRNA seed in mRNAs 
affected by TINCR knockdown, arguing against a competing endo- 
genous RNA (ceRNA) mechanism for TINCR (Supplementary Tables 
6 and 7). STAU1 depletion did not affect TINCR subcellular locali- 
zation (Supplementary Fig. 5e). To test whether the TINCR motif is 
required for target mRNA binding to TINCR, pull-down experiments 
were performed with a wild-type and TINCR-box-deficient target 
mRNA. Full TINCR binding to the PGLYRP3 differentiation gene 
mRNA occurred with or without STAUI protein, but was dependent 
on the 3’ PGLYRP3 TINCR box (Fig. 4d, e). However, the existence of 
TINCR- and STAU1-regulated differentiation gene mRNAs that don’t 
show direct interaction with TINCR (Supplementary Table 8) suggests 
the potential existence of other mechanisms of indirect target regu- 
lation in this setting. 

Recent work showed that STAU1-binding sites could be created by 
imperfect base pairing between an ALU element of an mRNA target of 
STAU1-mediated decay (SMD) and another ALU sequence in a half- 
STAU1-binding site ncRNA; STAU]1 binding to this sequence leads to 


10 JANUARY 2013 | VOL 493 | NATURE | 233 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


a b c 


siSTAU1 


1.0 


E value: 


RlA-enriched 
region motif (sense) 


RlA-enriched 
region motif (antisense) 


= 
5 08 9.3 x 10-203 |* 
2 
2 06 
is TINCR &%——}@ $reg $$$ 
oO 
5 oe ALOX12B gg 
$ 02 KRT1 Opp 
‘a an PGLYRP3 D___ gs 
; KRTeo @ 
SGPP2 B—___ 
siTINCR KAT7e 
*P = 1.24 x 10222 
PRSS27 De 


TMM foci: 


CTCCCTCCCCTIGCTGGGTCTCTCCA 


Figure 4 | Differentiation regulation by TINCR RNA and STAU1 protein. 
a, Diminished expression of TINCR-regulated genes in STAU1-depleted 
organotypic tissue in independent biological replicates for duplicate 
independent STAU1 siRNA treatments. Error bars are s.d., n = 3. b, Overlap of 
672 genes regulated by TINCR and STAUI ¢, Location of the TINCR box motif 
in TINCR as well as selected TINCR-associated differentiation mRNAs. RIA, 


mRNA degradation in a UPF1/2-dependent manner”. The lack of 
change in TINCR transcript levels in STAU1-depleted epidermis 
(Supplementary Fig. 5b) and the direct binding of TINCR to STAU1 
without other RNAs (Fig. 3b-d and Supplementary Fig. 3c) indicate 
that TINCR is neither a direct degradation target of STAU1 nora half- 
STAU1-binding site RNA. Moreover, a reverse search found that only 
142 of the 3,602 enriched TINCR target sites contain an ALU element, 
28 of which showed enrichment for the TINCR motif (P < 10 X 10 *), 
suggesting that most TINCR-mRNA interactions occur indepen- 
dently of ALU elements. To test whether TINCR-STAU1 acts by 
UPF1/2-dependent SMD, we generated UPF1-deficient as well as 
UPF1/2 double-deficient human epidermal tissue. UPF depletion 
failed to alter epidermal differentiation substantially (Supplementary 
Fig. 5c, d), indicating that siTINCR differentiation defects are not due 
to disrupting SMD and thus occur through a previously uncharacter- 
ized mechanism. To explore this, differentiation mRNA stability was 
assessed as a function of TINCR and STAU1 for the two TINCR box 
motif-containing differentiation mRNA, KRT80, in differentiating 
keratinocytes using actinomycin D. Although KRT80 mRNA de- 
creased by 24.6% after 2h in control differentiating keratinocytes, in 
siTINCR/siSTAU1 double-deficient keratinocytes it decreased by 
91.3% (Supplementary Fig. 7), consistent with differentiation mRNA 
stabilization by TINCR and STAU1. 

Among IncRNAs regulators*”’*°, TINCR acts in somatic differenti- 
ation post-development. TINCR may control differentiation mRNA 
abundance post-transcriptionally, as indicated by TINCR cytoplasmic 
localization, TINCR binding to differentiation mRNAs and the effect 
of TINCR on differentiation mRNA stability. Binding of TINCR to 
interacting mRNAs occurs through the TINCR box motif. Loss of the 
TINCR-associated cytoplasmic protein STAU1 resembles TINCR 
loss and demonstrates a new UPF1/2-independent role for STAU1 
in differentiation. 
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RNA interactome analysis. d, TINCR motif base-pairing between the TINCR 
transcript and PGLYRP3 differentiation gene mRNA. e, Biotinylated TINCR 
RNA with or without STAU1 protein pulls down the full-length 1,077-bp 
PGLYRP3 mRNA at higher efficiency than TINCR box-depleted PGLYRP3 
(PGLYRP3A96 bp). Error bars denote s.d. and are too small to be visible for the 
PGLYRP3A96 samples; n = 3. 


METHODS SUMMARY 


Primary human keratinocytes were isolated from freshly discarded surgical skin 
samples and grown in a 1:1 mixture of KSF-M (Gibco) and Medium 154 for 
keratinocytes (Gibco), supplemented with epidermal growth factor and bovine 
pituitary extract. Cells were cultured at 37°C in a humidified chamber with 
5% CO. Keratinocyte differentiation was induced in vitro by the addition of 
1.2mM calcium to the media, then cells were grown in full confluence. 

For siRNA-mediated gene transfer, 1 X 10° primary human keratinocytes were 
electroporated with 1 nmol siRNA oligonucleotide, using the human keratinocyte 
Nucleofector kit (Lonza). 

Immunofluorescence analysis was performed with 7-1m-thick skin sections 
from human organotypic skin cultures, fixed in 100% methanol or acetone for 
10 min followed by blocking in PBS with 10% bovine serum albumin (BSA) for 
30 min. Sections were incubated with primary antibodies for 1h. Antibodies were 
diluted in PBS with 1% BSA. For histological analysis, human organotypic epi- 
dermal tissue was fixed in 10% formalin (Sigma-Aldrich), embedded in paraffin, 
sectioned and stained with haematoxylin and eosin. 

qRT-PCR was performed using the Maxima SYBR Green qPCR master mix 
(2X, Fermentas) and the Mx3000P (Agilent) thermocycler. Samples were run in 
triplicate and normalized to 18S ribosomal RNA or ribosomal protein L32 mRNA. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Tissue culture. Primary human keratinocytes were isolated from freshly dis- 
carded skin surgical samples and grown in a 1:1 mixture of KSF-M (Gibco) and 
Medium 154 for keratinocytes (Gibco), supplemented with epidermal growth 
factor and bovine pituitary extract. Cells were cultured at 37 °C in a humidified 
chamber with 5% CO). Keratinocyte differentiation was induced in vitro by the 
addition of 1.2 mM calcium to the media, then cells were grown in full confluence 
for up to 8 days. 

Gene transfer. All siRNA oligonucleotide duplexes used in this work were syn- 
thesized by Dharmacon. One-million primary human keratinocytes were electro- 
porated with 1 nmol siRNA oligonucleotide, using the Amaxa human keratinocyte 
Nucleofector kit (Lonza) as well as Amaxa Nucleofection reagents as described 
previously*. The following siRNA oligonucleotides were used for this work: 
siControl (sense sequence): 5’-GUAGAUUCAUAUUGUAAGGUU-3’; siTINCRA 
(sense sequence): 5’-GCAUGAAGUAGCAGGUAUUUU-3’; siTINCRB (sense 
sequence): 5'-GAUCCCGAGUGAGUCAGAAUU-3’; siSTAUIA (target sequence): 
5'-GCAGGGAGUUUGUGAUGCA-3’; siSTAUIB (target sequence): 5’-CGAGU 
AAAGCCUAGAAUCA-3’; siUPFIA (target sequence): 5’-CAGCGGAUCGUGU 
GAAGAA-3'; siUPF1B (target sequence): 5'-CAAGGUCCCUGAUAAUUAUV-3’. 
Organotypic human epidermal tissue. For the generation of organotypic human 
epidermis, primary human keratinocytes were nucleofected with siRNA oligonu- 
cleotides and cultured for 12-36 h. Four-hundred-thousand nucleofected kerati- 
nocytes were seeded onto devitalized dermis and raised to the air-liquid interface 
to initiate stratification and differentiation of the epidermis culture as described 
previously’*. 

Immunofluorescence and tissue analysis. Seven-micrometre-thick skin sections 
from human organotypic skin cultures were fixed in 100% methanol or acetone 
for 10 min followed by blocking in PBS with 10% bovine serum albumin (BSA) 
for 30 min. Sections were incubated with primary antibodies for 1h. Primary 
antibodies were diluted in PBS with 1% BSA and include collagen type VII 
(Calbiochem, 234192 for rabbit; Millipore, MAB2500 for mouse) at 1:200 dilu- 
tion, filaggrin (Biomedical Technologies, BT-576) at 1:500 dilution, keratin 1 
(Covance, PRB-149P) at 1:2,000 dilution, keratin 10 (Neomarkers, MS-611-P) 
at 1:200 dilution, loricrin (Covance, PRB-145P) at 1:200 dilution, and transglu- 
taminasel (Biomedical Technology, BT-621) at 1:100 dilution. The secondary 
antibodies used were Alexa-555-conjugated goat anti-mouse and goat anti-rabbit 
IgG (Molecular Probes, 1:300 dilution), and Alexa-488-conjugated goat anti- 
rabbit and goat anti-mouse IgG (Molecular Probes, 1:300 dilution). The nuclear 
dye Hoechst 33342 (Molecular Probes) was used at 1:1,000 dilution. For histolo- 
gical analysis, human organotypic epidermal tissue was fixed in 10% formalin 
(Sigma-Aldrich), embedded in paraffin, sectioned and stained with haematoxylin 
and eosin. 

Transmission electron microscopy. Organotypic human epidermal tissue was 
fixed in 2% paraformaldehyde, 2% glutaraldehyde, 0.1M cacodylate buffer, 
pH7.3, 0.06% CaCl), post-fixed with ruthenium tetroxide (Polysciences) for 
45 min and stained for 15 min in 1:1 saturated uranyl acetate. After three washes 
in water, samples were dehydrated and infiltrated with EMbed-812 resin (EMS) 
mixed 1:1 with propylene oxide for 2h followed by two parts EMbed-812 to one 
part propylene oxide overnight. The samples were then placed into EMbed-812 for 
2-4h, transferred into moulds with fresh resin, oriented and incubated at 65 °C 
overnight. Ultrathin sections (80 nm) were picked up on formvar/carbon-coated 
slot copper grids and examined with the JEOL JEM-1400 TEM at 80 kV. Photos 
were taken using a Gatan Orius digital camera. 

qRT-PCR analysis. Total RNA from cells was isolated with Trizol reagent 
(Invitrogen) and genomic DNA removed using the TurboDNase kit (Ambion) 
and quantified by NanoDrop. Total RNA from organotypic skin cultures was 
isolated with the RNeasy Plus mini kit (Qiagen) according to the manufacturer’s 
instructions. One-microgram of total RNA was reverse transcribed with the iScript 
cDNA synthesis kit (Biorad). RT-PCR was performed using the Maxima SYBR 
Green qPCR master mix (2, Fermentas) and the Stratagene Mx3000P (Agilent 
Technologies) thermocycler. Samples were run in triplicate and normalized to 
18S rRNA, GAPDH or ribosomal protein L32. The following primer sequences 
were used: 18S forward: 5’-GCAATTATTCCCCATGAACG-3’, reverse: 5’-GG 
CCTCACTAAACCATCCAA-3’; L32 forward: 5'-AGGCATTGACAACAGG 
GTTC-3’, reverse: 5’-GTTGCACATCAGCAGCACTT-3’; GAPDH forward: 5'- 
GAAGAGAGAGACCCTCACTGCTG-3’, reverse: 5’-ACTGTGAGGAGGGG 
AGATTCAGT-3'; FLG forward: 5'-AAAGAGCTGAAGGAACTTCTGG-3’, 
reverse: 5’-AACCATATCTGGGTCATCTGG-3’; KRT1 forward: 5'-TGAGCTG 
AATCGTGTGATCC-3’, reverse: 5’-CCAGGTCATTCAGCTTGTTC-3’; KRT10 
forward: 5'-GCAAATTGAGAGCCTGACTG-3’, reverse: 5'-CAGTGGACACA 
TTTCGAAGG-3’; LOR forward: 5'-CTCTGTCTGCGGCTACTCTG-3’, reverse: 
5'-CACGAGGTCTGAGTGACCTG-3’; STAUI forward: 5’-ATGGTATCGGC 


AAGGATGTG-3’, reverse: 5'-AGACATTGGTCCGTTTCCTG-3'; ALOX12B 
forward: 5'-AGACTGCAATTCCGGATCAC-3’, reverse: 5’-TGTGGAATGCA 
CTGGAGAAG-3’; ALOXE3 forward: 5'-GAGCAAAAATCTCGCCAGTC-3’, 
reverse: 5'-GGGCTTTGTCTCAGAAATCG-3’; ABCA12 forward: 5'-AACAG 
TCCAAAGCCATCCAG-3’, reverse: 5'-GAGCAGCAGCAATTTCACAG-3’; 
CASP14 forward: 5'-TTCCGAAGAAGACCTGGATG-3’, reverse: 5’-TGG 
GGTCTCTTTTCATGGTG-3’; ELOVL3 forward: 5’-TTCGAGGAGTATTG 
GGCAAC-3’, reverse: 5'-GAAGATTGCAAGGCAGAAGG-3’; TINCR forward: 
5'-TGTGGCCCAAACTCAGGGATACAT-3’, reverse: 5’-AGATGACAGTGGC 
TGGAGTTGTCA-3’; ANCR forward: 5’-GCCACTATGTAGCGGGTTTC-3’, 
reverse: 5'-ACCTGCGCTAAGAACTGAGG-3’; LINC1 forward: 5'-TTCTGGA 
TGCAGCCACACTTCACA-3’, reverse: 5’-TGCCAGAGGAATTCTGTTTT-3’; 
UPFI forward: 5’-ATATGCCTGCGGTACAAAGG-3’, reverse: 5’-AGCTCAAT 
GGCGATCTCATC-3'; HIST1H2BG forward: 5'-ACAAGCGCTCGACCATTA 
CCT-3’, reverse: 5’-TGGTGACAGCCTTGGTACCTTC-3’; PGLYRP3 forward: 
5'-GCCAGGCAGTCTCATTTACC-3’, reverse: 5’-AGAGAAGCCAGCATCAC 
CTC-3'; PRSS27 forward: 5’-AGTTCATGCCCGTCTCAAAG-3’, reverse: 5’-GC 
CCTTCACCAATTACATCCT-3’. 

Northern blot analysis. Total RNA from keratinocytes was isolated with Trizol 
reagent (Invitrogen). A TINCR-specific, radioactive DNA probe with a length of 
581 bp was generated using [o-*?P]dCTP (Perkin Elmer) and the Megaprime 
DNA labelling system (GE Healthcare). Hybridization was performed using 
QuickHyb (Agilent), following the manufacturer’s instructions. 
Single-molecule RNA FISH. Keratinocytes were grown on chambered cover 
glasses (Lab-Tek) and fixed in 3.7% glutaraldehyde in PBS. Forty fluorophore- 
linked antisense DNA probes targeting the TINCR full-length sequence were 
designed using the online designer at http://www.singlemoleculefish.com. 
Probes mapping to homologous or repeat sequences were excluded. In situ hybrid- 
ization and imaging was performed as described previously’’. 

Full transcriptome RNA sequencing. RNA sequencing libraries were prepared 
with the mRNA Seq sample prep kit (Illumina), following the manufacturer’s 
instructions. In brief, total RNA was isolated from differentiated (days 3 and 6) 
and progenitor human keratinocyte populations, and mRNA was extracted by 
polyA selection from 1-2 ug total RNA per sample. PolyA-selected RNA was 
fragmented and randomly primed for reverse transcription, followed by second- 
strand complementary DNA synthesis, end repair, adenylation of 3’ ends, adaptor 
ligation and PCR amplification of approximately 300-bp cDNA fragments. High- 
throughput full transcriptome sequencing was undertaken using the Illumina 
paired-end HiSeq platform with a read length of 101 nucleotides. Reads were 
aligned to the human reference sequence NCBI Build 36.1/hg18 with the 
TopHat algorithm”, resulting in between 100 million and 110 million mapped 
reads per sample. Differential expression analysis was performed with Cuffdiff, 
using human RefSeq transcripts as a reference transcriptome. Annotated RefSeq 
non-coding RNAs expressed during calcium-induced differentiation at an 
FPKM > 5 in at least one time point were selected based on greater than twofold 
change with false discovery rate (FDR)-adjusted P<0.05 and included in the 
cluster heat map. Non-coding RNAs that overlapped with protein-coding genes 
were excluded. 

Gene expression profiling. Microarray analysis was performed on biological 
duplicate samples. Labelling of cDNA derived from control and TINCR-depleted 
human organotypic epidermis was done using the MessageAmp III] RNA amp- 
lification kit (Ambion), following the manufacturer’s instructions. Hybridization 
to Affymetrix GeneChip HG-U133 Plus 2.0 arrays was carried out at Stanford’s 
Protein and Nucleic Acid Facility. Gene expression analysis was performed as 
described previously’. 

RNA interactome analysis. Antisense DNA probes were designed target- 
ing the TINCR full-length sequence using the online designer at http://www. 
singlemoleculefish.com. Probes mapping to homologous or repeat sequences were 
excluded. Thirty-eight probes were generated and split into two sets based on their 
relative positions along the TINCR sequence such that even-numbered and odd- 
numbered probes were separately pooled, similar in design to other RNA-binding 
approaches taken successfully in single-molecule RNA FISH". All probes were 
biotinylated at the 3’ end with an 18-carbon spacer arm (Protein and Nucleic Acid 
Facility, Stanford University). Cells were grown in differentiation conditions and 
rinsed once with room-temperature PBS, followed by fixation with 1% glutaral- 
dehyde in PBS for 10 min at room temperature. Crosslinking was then quenched 
with 0.125 M glycine for 5 min. Cells were rinsed again with PBS, scraped into 
Falcon tubes, and pelleted at 1,500g. Cell pellets were snap frozen in liquid nitrogen 
and could be stored at —80 °C. To prepare lysates, cell pellets were quickly thawed 
in a 37°C water bath and resuspended in cell lysis buffer (50mM Tris, pH7.0, 
10mM EDTA, 1% SDS, and added just before use: dithithreitol (DTT), phenyl- 
methylsulphonyl fluoride (PMSF), protease inhibitor and Superase-In) at 
100 mg ml! on ice for 10 min, and sonicated using Bioruptor (Diagenode) until 
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lysates were completely solubilized. RNA was in the size range of 100 to 
500 nucleotides. Cell lysate was diluted in double the volume of hybridization 
buffer (500 mM NaCl, 1% SDS, 100mM Tris, pH7.0, 10mM EDTA, 15% for- 
mamide, and added just before use: DIT, PMSF, protease inhibitor, and Superase- 
In. Probes (100 pmol) were added to 3 ml of diluted lysate, which was mixed by 
end-to-end rotation at 37°C overnight. Streptavidin-magnetic C1 beads were 
washed three times in cell lysis buffer, 1 mg (100 jl) of beads was added to hybridi- 
zation reaction per 100 pmol of probes, and the whole reaction was mixed for 
30 min at 37 °C. Beads-biotin-probes-RNA adducts were captured by magnets 
(Invitrogen) and washed five times with a wash buffer volume equivalent to ten 
times the volume of the beads (2XSSC, 0.5% SDS, fresh PMSF added). After the 
last wash, buffer was removed carefully. For RNA elution, beads were resuspended 
in 200p1 RNA proteinase K buffer (100mM NaCl, 10mM Tris, pH 7.0, 
1mM EDTA, 0.5% SDS) and 1 mg ml | proteinase K (Ambion). After incubation 
at 50 °C for 45 min, followed by boiling for 10 min, RNA was isolated using Trizol 
reagent. Eluted RNA was subject to DNase treatment (TURBO DNase kit, 
Ambion) followed by qRT-PCR for the detection of enriched transcripts. For 
TINCR box removal, full-length PGLYRP3 (1,077-bp length) was cut with the 
restriction endonuclease CspCl, resulting in loss of a 96-bp fragment at the 3’ end 
of the transcript. LacZ RNA was added as control. qRT-PCR data was normalized 
to TINCR and LacZ. For high-throughput sequencing, RNA was first converted 
into cDNA using the Ovation V2 kit (Nugen). cDNA samples were then fragmen- 
ted to an average of 130 bp by Covaris sonicator, and 200 ng of shortened cDNA 
samples was converted into libraries compatible with Illumina Genome Analyzer 
IIx using the NEBNext ChIP-seq Library prep set (NEB). Approximately 22 million 
to 32 million reads were mapped per sample and used for downstream analysis. 
RIA-Seq data analysis. Sequencing reads were mapped to NCBI build 36.1/hg18 
using TopHat” with default options. Search for enriched peaks in the ‘even’ and 
‘odd’ samples compared to input control was performed by scanning each gene 
using 100-nucleotide sliding windows. The number of reads overlapping with each 
sliding window was calculated using the ‘coverageBed’ function from BEDTools 
for even, odd, and input samples”. Signals in even, odd and input samples were 
defined as normalized read counts to the total number of aligned reads in each 
sample. Only sliding windows that have at least one read in every sample were 
included for downstream analysis. Enrichment score of each sliding window was 
defined as: 


min (even_signal; ,odd_signal;) 


EScore; 


input_signal; 


The average log>(EScore) and standard deviation of log,(EScore) were also calcu- 
lated and noted here as mean and s.d. A sliding window was defined as an enriched 
site if its log,(EScore) > mean + 1.5s.d. We discovered 3,602 enriched sliding 
windows, which cover 1,852 RefSeq genes (defined to be enriched genes). Enriched 
sliding windows were ranked by EScore and sequences of the top 1,500 sliding 
windows were extracted and used for de novo motif search in MEME”. Sequences 
of genes that were enriched in RIA-Seq and also differentially expressed in 
siTINCR and siSTAU (significance analysis of microarrays FDR<0.05 and 
greater than twofold change in signal intensity in siControl versus siTINCR or 
siSTAUl1 samples) were also extracted for reverse motif search using the FIMO 
algorithm’’. GO terms associated with enriched genes were calculated using 
DAVID™. 

ProtoArray hybridization. The sense and antisense TINCR RNAs were in vitro 
transcribed from the pBluescriptR-TINCR vector (Open Biosystems) using T7 
and T3 polymerases (Promega) in accordance with the manufacturer’s instruc- 
tions. RNA labelling for microarray hybridizations was performed using the Label 
IT pArray Cy5 labelling kit (Mirus) to achieve the labelling efficiency of roughly 
3pmol Cy5 dye per microgram of RNA. Human Protein Microarrays v5.0 
(Invitrogen) were independently hybridized with 10 pmol sense and antisense 
strands of TINCR RNA in 40 mM Tris-HCl, pH 8.0, 150mM sodium chloride, 
0.5mM magnesium acetate, 10 pg ml! Yeast transfer RNA, 10 ug ml heparin, 
1mM DTT, 0.01% Igepal CA-630, 5% glycerol and 0.2U ul’ RNaseOQUT 
(Invitrogen), incubated in the dark at 25°C for 1h and after extensive washes 
spin dried and scanned at 635nm (Cy5) using the GenePix 4000B Microarray 
scanner (Molecular Devices). The intensity of the 635-nm signal at each spotted 
protein location was determined with GenePix Pro 6.1 software (Molecular 
Devices). To quantify RNA-protein interactions, the intensity of the 635-nm 
signal (F635) was divided by the local background intensity (B635) at each of 
the duplicate spots for a given protein. Data were filtered based on signal above 
background, including duplicate features with mean signal intensities greater than 
twofold above local background into the analysis. For the visualization process, the 
array images from antisense RNA hybridizations were pseudocoloured green and 
overlaid with the sense RNA hybridization images. 
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Protein immunoprecipitation with subsequent RNA detection. For in vitro 
protein-RNA complex immunoprecipitation, pcDNA3.1Hygro-STAUI1-HA 
plasmid containing carboxy terminal 3 HA-tagged human staufen1 was in vitro 
translated using the rabbit reticulocyte lysate system (Promega) according to the 
manufacturer’s instructions. The sense strands of full-length human non-coding 
RNAs (TINCR, LINC1 and ANCR) were in vitro transcribed using T7 promoter. 
Two-hundred-and-fifty nanograms of TINCR, LINC1 (325 nucleotides; chr1q21.3) 
or ANCR (855 nucelotides; chr4q12) RNA were combined with 12.5 ul in vitro 
translated STAU1-HA in immunoprecipitation buffer containing 40mM Tris- 
HCl, pH 8.0, 150mM sodium chloride, 0.5 mM magnesium acetate, 20 pg ml! 
heparin, 1mM DTT, 0.01% or 1% Igepal CA-630, 5% glycerol, 0.2U ult 
RNaseOUT (Invitrogen) and protease inhibitor Complete Mini (Roche). After 
1h of incubation STAU1-HA was immunoprecipitated with 201 Protein G 
Dynabeads (Invitrogen), bound to 2.5yg anti-HA antibody (HA.11, clone 
16B12, Covance), washed extensively with immunoprecipitation buffer, and 
RNA from the residual mixture was extracted with phenol-chloroform followed 
by ethanol precipitation. The cDNA was then synthesized with iScript cDNA 
synthesis kit (Bio-Rad) and subjected to qRT-PCR. 

In vitro RNA pull-down with subsequent protein detection. For in vitro RNA 
pull-down, TINCR, LacZ and ANCR mRNAs were labelled with biotin-16-UTP 
(GE Healthcare) during in vitro transcription reaction according to the manu- 
facturer’s instructions. Five microlitres of in vitro translated STAUI-HA was 
incubated with 1 jg biotin-16-UTP-labelled TINCR, LacZ or ANCR in immuno- 
precipitation buffer for 30min at 25°C. After addition of 5ul of MyOne 
Streptavidin T1 Dynabeads (Invitrogen), the mixture was incubated for a further 
30 min and subjected to five wash cycles of 5 min, each using 500 pil IPB buffer. 
After the final wash, magnetic beads were resuspended in 12 ll protein-loading 
buffer, RNA-bound protein separated by SDS-PAGE and detected with anti-HA 
monoclonal antibody (Covance, MMS-101P) by western blot analysis. Western 
blot analysis was performed as described previously’. 

RNA stability assay. For analysis of RNA stability, keratinocytes were treated with 
actinomycin D (6 1g ml!) at day 3 of differentiation. Cells were collected at 0, 4, 
12 and 20h time points and RNA extracted using Trizol reagent (Invitrogen). 
Reverse transcription was performed using oligo(dT) primers and mRNA levels 
were measured by qRT-PCR. 

Characterization of TINCR motif box secondary structure. For the acylation of 
TINCR in a typical in vitro modification protocol, 6 jig total RNA was heated in 
metal-free water for 2 min at 95°C. The RNA was then flash-cooled on ice. The 
RNA 3X SHAPE buffer (333 mM HEPES, pH 8.0, 20mM magnesium chloride, 
333 mM sodium chloride) was added and the RNA was allowed to equilibrate at 
37 °C for 10 min. To this mixture, 1 tl of 10X electrophile (130 mM, NMIA) stock, 
with or without dimethylsulphoxide, was added. The reaction was permitted to 
continue until the desired time. Reactions were extracted once with acid phenol- 
chloroform (pH 4.5 + 0.2) and twice with chloroform. RNA was precipitated with 
40 ul of 3M sodium acetate buffer, pH5.2, and 1 pl of glycogen (20 pg pl’). 
Pellets were washed twice with 70% ethanol and resuspended in 10 pl RNase-free 
water. For reverse transcription of modified RNA, [*’P]-end-labelled DNA pri- 
mers were annealed to 3 jig of total RNA by incubating at 95 °C for 2 min followed 
by a step-down cooling (2°Cs_') to4°C. To the reaction first-strand buffer, DTT 
and deoxynucleotides were added. The reaction was preincubated at 52°C for 
1 min, then superscript III (2 U pl! final concentration) was added. Extensions 
were performed for 10 min. To the reaction, 1 pil of 4M sodium hydroxide was 
added and allowed to react for 5min. Ten microlitres of gel loading buffer II 
(Ambion) was then added and cDNA extensions were resolved on 8% denaturing 
(7M urea) polyacrylamide gels (29:1 acrylamide:bisacrylamide, 1% TBE). 

For the characterization of reverse transcription stops, cDNA extensions were 
visualized by phosphorimaging (STORM, Molecular Dynamics). cDNA bands 
were integrated with SAFA*. SHAPE reactivities were normalized to a scale span- 
ning 0 to 1.5, in which 1.0 is defined as the mean intensity of highly reactive 
nucleotides**. RNA secondary structures were predicted using mFOLD software’’. 
For secondary structure analysis, the following TINCR radiolabelled RT primers 
were used: (1) 5’-CCTTGATGTGGTAGCGCTTCCAGCGC-3’; (2) 5’-AGACA 
GGGCACCCAGGGCCCCAAGAGG-3’; (3) 5'-TGTCCTGGGCAAGAGCGGA 
AGTGCCTC-3’; (4) 5'-AAAGCAGCTCCAGCAGGTCTGCCTGCCG-3’; (5) 
5'-GGGCACACAGTGGGTCTCTGGGGACAAAG-3’; (6) 5’-TACTGTTTGTT 
AAATGTCAAAACACCCTG-3’; (7) 5’-TTTCCTCGGTGTGGCTGTGGGACC 
TTAGG-3’; (8) 5’-AAGCCTATCAGGCCTGGAGCTTCCTCAAAG-3’; (9) 5’- 
GAAGTAGCAGGTATTGAAGCTAGG-3’; and (10) 5’-TGCTGGGAGGAGA 
CACAGACCTCC-3’. 
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COUP-TFII inhibits TGF-B-induced growth barrier 
to promote prostate tumorigenesis 


Jun Qin', San-Pin Wu!, Chad J. Creighton’, Fangyan Dai, Xin Xie!, Chiang-Min Cheng’, Anna Frolov*, Gustavo Ayala*+, Xia Lin’, 
Xin-Hua Feng’, Michael M. Ittmann’*, Shaw-Jenq Tsaib?, Ming-Jer Tsaib® & Sophia Y. Tsai>® 


Mutations in phosphatase and tensin homologue (PTEN) or geno- 
mic alterations in the phosphatidylinositol-3-OH kinase-signalling 
pathway are the most common genetic alterations reported in 
human prostate cancer’ *. However, the precise mechanism under- 
lying how indolent tumours with PTEN alterations acquire meta- 
static potential remains poorly understood. Recent studies suggest 
that upregulation of transforming growth factor (TGF)-B signal- 
ling triggered by PTEN loss will form a growth barrier as a defence 
mechanism to constrain prostate cancer progression’, underscoring 
that TGF-B signalling might represent a pre-invasive checkpoint to 
prevent PTEN-mediated prostate tumorigenesis. Here we show 
that COUP transcription factor II (COUP-TFI, also known as 
NR2EF2)*°, a member of the nuclear receptor superfamily, serves 
as a key regulator to inhibit SMAD4-dependent transcription, 
and consequently overrides the TGF-B-dependent checkpoint for 
PTEN-null indolent tumours. Overexpression of COUP-TFII in the 
mouse prostate epithelium cooperates with PTEN deletion to aug- 
ment malignant progression and produce an aggressive metastasis- 
prone tumour. The functional counteraction between COUP-TFII 
and SMAD4 is reinforced by genetically engineered mouse models 
in which conditional loss of SMAD4 diminishes the inhibitory 
effects elicited by COUP-TFII ablation. The biological significance 
of COUP-TFII in prostate carcinogenesis is substantiated by patient 
sample analysis, in which COUP-TFII expression or activity is 
tightly correlated with tumour recurrence and disease progression, 
whereas it is inversely associated with TGF-B signalling. These 
findings reveal that the destruction of the TGF-B-dependent barrier 
by COUP-TFII is crucial for the progression of PTEN-mutant pro- 
state cancer into a life-threatening disease, and supports COUP- 
TFII as a potential drug target for the intervention of metastatic 
human prostate cancer. 

COUP-TFII is known to promote tumour angiogenesis through 
its function in the tumour microenvironment!™!', but its role in 
tumour cells remains undefined. Consistent with aberrant expression 
of COUP-TFII reported in various tumours’*-'”, Oncomine expression 
analysis revealed higher COUP-TFII expression in prostate tumour 
cells and in metastatic prostate cancer than in primary prostate can- 
cer’*! (Supplementary Fig. 1a). To assess the clinical relevance of 
COUP-TFI in human prostate cancer, we stained for COUP-TFII 
in a tumour tissue microarray (TMA) consisting of 407 patient speci- 
mens. Examination of clinical prostate specimens showed higher levels 
of COUP-TFII expression in prostate tumour cells in comparison to 
the adjacent normal prostate epithelium (Supplementary Fig. 1b). As 
summarized in Fig. 1a, approximately 60% of human prostate cancer 
specimens exhibited intermediate to intense nuclear COUP-TFII 
staining, whereas only 5% of normal prostate epithelial cells stained 
positive for COUP-TFIL. Further correlation studies indicated that 
COUP-TFII expression in tumour cells significantly associated with 


pathological predictors of human prostate cancer aggressiveness (Sup- 
plementary Table 1). Most importantly, univariate analysis indicated 
that COUP-TFI expression levels in prostate tumour cells could serve 
as a predictor to stratify risk of recurrence in patients; patients bearing 
higher levels of COUP-TFII expression showed earlier recurrence 
after radical prostatectomy (Fig. 1b). Cox proportional hazards 
regression analyses showed that tumour recurrence was sensitive to 
gradual increases of COUP-TFII expression levels, as reflected by 
the increasing hazard ratios, from 1.5 to 1.8 and 2.5. 

These results prompted us to use genetically engineered mouse mo- 
dels to determine the causal role of COUP-TFII in prostate tumori- 
genesis. As such, we engineered mice containing PB-Cre (the Cre gene 
under control of the prostate-specific probasin promoter)'® and con- 
ditional knockout alleles of Pten? and/or COUP-TFII (also known as 
Nr2f2)’, hereafter referred to as Pten®~'~ and Pten?°~/~; COUP- 
TFII’°~’~ mice, respectively (Supplementary Fig. 2a, b). PTEN-null 
mice lacking COUP-TFII exhibited a significant reduction in prostate 
weight (Supplementary Fig. 2c), suggesting a positive role for COUP- 
TFII in driving prostate tumorigenesis. Histopathological analysis of 
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Figure 1 | COUP-TFII is crucial for prostate cancer progression in human 
and mice. a, Expression and clinical relevance of COUP-TFII are assessed in a 
human prostate cancer TMA. COUP-TFII staining indexes in cohorts of 
normal prostate epithelial cells (n = 407) and prostate tumours (n = 416) 

are shown as stacked columns. b, Kaplan-Meier plot of the recurrence after 
radical prostatectomy based on the COUP-TFII expression index in patients. 
HR, hazard ratio. c, Haematoxylin and eosin (H&E) staining of the anterior 
prostate from Pten?“~ and Pten’© ’; COUP-TFIF’ © “~ mice at 5 months of 
age. Scale bar, 50 um. d, Pie graphs show prostate tumour progressions in 
Pten’©/~ and Pten’°~’-; COUP-TFIP’° /~ mice (5 months old; n = 10). 
HGPIN, high-grade PIN; LGPIN, low-grade PIN. 
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the entire animal cohort showed that loss of COUP-TFII com- 
promised prostate tumour progression, as reflected by the arrest of 
tumours at the hyperplastic or low-grade prostatic intraepithelial 
neoplasia (PIN) stage at 5months of age, whereas tumours from 
Pten®©~’~ mice already advanced to high-grade PIN or adenocar- 
cinoma (Fig. lc, d and Supplementary Fig. 2d). The normal pro- 
static histology in COUP-TFII’© ’~ mice (Supplementary Fig. 2e) 
argued against a possibility that it was due to a developmental 
defect. Moreover, loss of COUP-TFII attenuated the proliferation 
advantage elicited by PTEN loss, whereas the levels of angiogenesis 
remained similar and thus is unlikely to have contributed to the 
restriction of prostate tumorigenesis (Supplementary Fig. 2f, g). 

Given that COUP-TFII was further upregulated in metastatic pro- 
state cancer, and its higher expression correlated with a worse clinical 
outcome, we sought to determine whether overexpression of COUP- 
TFI in PTEN-null tumours would exacerbate prostate tumorige- 
nesis. As such, we generated an inducible COUP-TFII overexpression 
mouse, COUP-TFIO”’* (Fig. 2a), that allows constitutive COUP-TFII 
expression in the prostate epithelium. During 12 months of follow-up, 
COUP-TFII*”* mice exhibited normal prostatic histology (Supplemen- 
tary Fig. 3a), indicating that COUP-TFI by itself is not sufficient to ini- 
tiate prostate neoplasia, and that cooperating oncogenic lesions are 
required. Indeed, overexpression of COUP-TFII in a PTEN-heterozyg- 
ous background resulted in a rapid acceleration of tumour progression, 
with overt and high penetrance PIN starting at 4months of age and 
progressing to high-grade PIN or adenocarcinoma by 12 months 
(Fig. 2b). These data demonstrate that COUP-TFII enables rapid disease 
progression from preneoplastic prostatic epithelium. 

Next, we determined whether prostate-specific overexpression of 
COUP-TFII in a PTEN-null background would produce a metasta- 
sis-prone cancer. By 24 weeks of age, Pten®“~’~; COUP-TFII”* mice 
developed highly aggressive carcinoma with an altered stroma, as evi- 
denced by the attenuation or loss of the periglandular smooth muscle 
layer and androgen receptor (AR)-positive tumour cells invading into 
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the stroma (Fig. 2c), in which subsets of invasive tumour cells exhibited 
epithelial-mesenchymal transition (Supplementary Fig. 3c). Molecu- 
lar pathological analysis of prostate-cancer-bearing Pten’© ’~ ; COUP- 
TFC" mice also showed a metastatic spread of AR-positive tumour 
nodules to lumbar lymph nodes in 13 out of 16 cases, and lung meta- 
stasis in 4 out of 16 cases (Fig. 2d). By contrast, Pten’“~/~ mice only 
showed a modest metastatic phenotype” *. Even with more aggressive 
tumour phenotypes, angiogenesis was not altered whereas prolifera- 
tion was enhanced after COUP-TFII overexpression (Fig. 2e and 
Supplementary Figs 3d, e and 4). Collectively, COUP-TFII overex- 
pression enhances the full spectrum of prostate cancer development, 
leading to aggressive metastasis in PTEN-null tumours. 

To determine how COUP-TFII affects tumorigenesis, we con- 
ducted a transcriptome comparison using COUP-TFII-depleted PC3 
cells. Unbiased ingenuity pathway analysis indicated that COUP-TFII 
was crucial for the growth of human prostate cancer cells, an obser- 
vation reinforced by anchorage-independent growth assays and cell 
proliferation/viability tests (MTT assay) (Supplementary Fig. 5). 
Further gene-set enrichment analysis and comparison with our gene 
expression profile to a defined TGF-B response signature’? indicated 
a global enrichment of TGF-B-induced genes in the absence of 
COUP-TFI (Fig. 3a and Supplementary Fig. 6a). Given that TGF-B 
signalling is crucial for prostate cancer progression®”°, we examined 
whether COUP-TFII potentiated prostate tumorigenesis through 
TGF-6 signalling. In human prostate cancer cells, COUP-TFH served 
as a transcriptional inhibitor of TGF-f signalling (Supplementary 
Fig. 6b-d). Knockdown of COUP-TFII in PC3 cells substantially 
altered p21, p15 and cyclin D1 expression in response to TGF- stimu- 
lation (Supplementary Fig. 6e). Protein levels of p21 were also 
increased without TGF-f stimulation, presumably owing to autocrine 
TGE-B signalling in PC3 cells (Supplementary Fig. 6f). 

We postulated that overexpression of COUP-TFII enabled the 
indolent PTEN-null tumours to acquire metastatic potential through 
the destruction of a TGF-f-induced growth barrier. Indeed, in 
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Figure 2 | Prostate-specific overexpression of COUP-TFII promotes 
tumorigenesis in PTEN mutated mice. a, Scheme of conditional 
overexpression of COUP-TFII (Rosa26-lox-STOP-lox-COUP-TFI; 
COUP-TFI"*) in a PTEN spontaneous tumour model. COUP-TFIIO”* 
mice contain a single copy of a mini gene, consisting of a CAGGS promoter, a 
loxP-STOP-loxP (LSL) cassette and Myc-tagged COUP-TFII complementary 
DNA, knocked into the ROSA locus. In the presence of PB-Cre recombinase, 
the LSL cassette is excised, allowing COUP-TFII expression in the prostate 
epithelium. b, H&E-stained sections of representative anterior prostate at 4 
and 12 months of age in Pten?“*~ and Pten’©*’-; COUP-TFIC* mice. 


Arrows indicate PIN. c, H&E-stained sections of anterior prostate at 24 weeks 
of age in Pten’© ’~ and Pten’° ’ ; COUP-TFIO”* mice (top). 
Immunohistochemistry of o%-smooth muscle actin (SMAa) (middle) and AR 
(bottom) expression. d, Immunohistochemical staining of AR expression in 
lumbar lymph nodes (LN) and lung from 6-month-old Pten?°“~ and 
Pten’°’—; COUP-TFII°?™’* mice. e, Semi-quantitative results of Ki67 and 
CD31 staining of prostate tumours from 6-month-old Pten’°~’~ and 
Pten’°-/~; COUP-TFIO”* mice n = 6 (b-e); *P < 0.05. Data are mean and 
s.e.m. Scale bars, 100 ttm (b, d) and 50 um (c). 
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Figure 3 | COUP-TFII inhibits TGF-B signalling in prostate cancer cells. 

a, Heat map of microarray results shows the enrichment of TGF-B downstream 
target genes in COUP-TFII-depleted PC3 cells. siCOUP-TFII denotes COUP- 
TFII short interfering RNA (siRNA); ctrl denotes control. b, Western blot 
analysis of core regulators and downstream targets of TGF-f signalling, as well 
as p53, in anterior prostates from 24-week-old mice as indicated. c, The 
correlation between COUP-TFII and TGF-B signalling targets (p21 and cyclin 
D1) in patient prostate tumours and normal prostate counterparts are shown in 


Pten?°; COUP-TFIIC“’* mice, the TGF-B-induced growth barrier 
was significantly attenuated after increased COUP-TFII expression 
(Fig. 3b). Overexpression of COUP-TFII profoundly neutralized on- 
cogene-induced senescence, as judged by the reduction of senescence- 
associated f-galactosidase staining (Supplementary Fig. 6g), whereas 
ablation of COUP-TFI in PTEN-null mice enhanced the acquisition 
of hyperactivity of TGF-B signalling and a marked induction in cel- 
lular senescence. Furthermore, the clinical relevance between COUP- 
TFII levels and TGF-B signalling activities was established by a signifi- 
cant correlation between COUP-TFII and p21/cyclin D1 expression in 
prostate tumours, but not in normal prostate epithelium counter- 
parts, in a cohort of 36 patients with highly aggressive prostate cancer 
(Fig. 3c). 

Next, we extended our findings by querying the gene expression 
profiling data set GSE10645 (ref. 21) that contains 596 primary human 
tumours with prostate cancer-specific death information, to comple- 
ment our TMA results. Using gene expression profiles obtained in PC3 
cells, we found a total of 80 COUP-TFIl-regulated genes represented 
in GSE10645 (Methods, Supplementary Fig. 7 and Supplementary 
Table 2); the genes were used to score each of the human tumours 
for COUP-TFII activity. We observed distinct differences in the time 
to prostate-cancer-specific death, although no differences in the time 
to prostate-specific antigen (PSA) recurrence, among the patients with 
higher versus lower COUP-TFII activity (Fig. 3d and Supplementary 
Table 3-5). Multivariate Cox model analysis indicated that the PTEN, 
SMAD4, p21, cyclin D1 and COUP-TFII signature (Methods) all 
independently provided predictive power to stratify patients into high- 
and low-risk groups (Fig. 3e). Plotting the combined transcript levels 
of PTEN, SMAD4, p21 and cyclin D1 together with the higher 
COUP-TFII signature in prostatectomies increased the predictive 
power from that of either gene set alone (Fig. 3f), illustrating a 
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regression plots (n = 36). d—f, Association of COUP-TFII gene signature with 
disease-specific death in patients with prostate cancer using data set GSE10645. 
d, Kaplan-Meier plot, with top third, middle third and bottom third of patients 
representing COUP-TFI activities from high to low (P value by log-rank test). 
e, Multivariate Cox regression analysis of COUP-TFI signature and the 
transcription levels of PTEN, SMAD4, p21 and cyclin D1. f, Kaplan-Meier plot 
of patients grouped using the combination of COUP-TFII and PTEN/SMAD4/ 
cyclin D1/p21 gene sets. 


significant genetic cooperation between COUP-TFII and TGF-B 
signalling in PTEN-null human prostate cancer tumours. 

Because depletion of COUP-TFII failed to affect the levels of 
SMAD2/3/4, the receptors or the activated form of phosphorylated- 
SMAD2/3 (Supplementary Fig. 8), we investigated whether COUP- 
TFII could interact directly with any of these SMAD proteins. Indeed, 
COUP-TFII strongly associated with SMAD4 in cells (Supplementary 
Fig. 9a-c) and in patient tumour specimens (Fig. 4a). Overexpres- 
sion of mutant COUP-TFII-M2, which was unable to associate with 
SMAD4, failed to inhibit TGF-f signalling, indicating that the COUP- 
TFII-SMAD4 interaction was essential for the COUP-TFII effects on 
TGF-B signalling (Supplementary Fig. 9d-f). Mechanistically, our 
data indicated that COUP-TFII sequestered SMAD4 from binding to 
TGF-B-target gene promoters in cells (Supplementary Fig. 10) and in 
tumours containing higher levels of COUP-TFII (Fig. 4b). 

As supporting evidence for the counteraction between COUP-TFII and 
SMAD4, knockdown of SMAD4 in PC3 cells alleviated the proliferative 
defects exerted by COUP-TFII depletion (Supplementary Fig. 11). 
Furthermore, we used a SMAD4 conditional knockout mouse” to 
examine the tumour spectrum in Pten?--; COUP-TFIF~’— and 
Pten’°-’~; COUP-TFI°”* mice. Compared with hyperplasia or low- 
grade PIN lesions observed in 5-month-old Pten’°-/-; COUP-TFIF'C /— 
mice, ablation of SMAD4 in COUP-TFII-null mice restored the full spec- 
trum of tumour development as evidenced by the abundance of Ki67- 
labelled nuclei, the appearance of microinvasive tumours and the similar 
incidences of lymph node metastasis between Pten?-; Smad4eeo/— 
and Pten’°~/~; COUP-TFIP° ; Smad4°"/—_ mice (Fig. 4c, d and 
Supplementary Figs 12-14). Moreover, overexpression of COUP-TFII 
in Pten’~’; Smad4°~~’~ mice did not further aggravate the invasive 
phenotypes displayed by the Pten”“~’~; Smad4°°~~ mice (Fig, 4c, d and 
Supplementary Figs 12-14). 
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Figure 4 | COUP-TFII interacts with SMAD4 to modulate TGF-B 
signalling. a, Co-immunoprecipitation (IP) analysis of COUP-TFII and 
SMAD4 association in human prostate tumour specimens. IB, immunoblot. 
b, Cell lysates were prepared from prostate tumours in Pten’<~’~ and 
Pten’©’~; COUP-TFII°”’* mice (6 months of age, n = 3), and subjected to 
chromatin immunoprecipitation analysis of SMAD4 binding at the p21 or 
cyclin D1 promoter. c, d, Analyses of the lymph node metastasis (c) and SMAo 
expression (d) in prostrate tumours from Pten’°/, Pten’°-/~; COUP- 
TIO”, ten’ ’ ; COUP-TFII’” ’~ mice with or without SMAD4 deletion 
(5 months of age). e, Model of COUP-TFII in prostate tumorigenesis. PTEN 


In summary, our findings highlight a crucial role for COUP-TFI in 
driving full malignant progression of PTEN-null prostate tumorigenesis 
by counteracting TGF-B negative feedback signalling (Fig. 4e). As 
COUP-TFII is a member of the nuclear receptor family, the activity of 
which can be regulated by small molecules”, our studies provide a 
potential new drug target for the intervention of metastatic human 
prostate cancer. 


METHODS SUMMARY 

Tissue microarrays used in this study were described previously~*”’. All animal 
experiments were approved by the Animal Center for Comparative Medicine at 
Baylor College of Medicine. Detailed materials, methods and statistical analysis are 
provided in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Immunohistochemistry and quantitative reverse transcriptase PCR analysis. 
Tissue microarrays used in this study were described previously**”’. Samples were 
procured from radical prostatectomies of 416 patients, who received no adjuvant 
therapy such as radiation or hormonal therapy. Other patient characteristics 
were as described previously****. Immunohistochemical analyses were done as 
described**** using a specific anti-COUP-TFII antibody (Perseus Proteomics). 
Samples were scanned using a Bliss automated slide scanner to generate high- 
resolution digital images. The expression of COUP-TFI protein was scored and 
quantified as described previously~*”. In brief, the slides were digitized and nu- 
clear COUP-TFII protein expression was scored and quantified based on a mul- 
tiplicative index of the average staining intensity (0-3) and extent of staining (0-3) 
in the cores, yielding a 10-point staining index that ranged from 0 (no staining) to 
9 (extensive, strong staining). Quantitative PCR (qPCR) was performed on RNAs 
isolated from patients with prostate cancer (n = 36) undergoing a radical prosta- 
tectomy for clinically localized prostate cancer with clinical characteristics as 
previously described’. The tissues were >70% cancerous based on histopatholo- 
gical analysis. 

Animal experiments. All experiments were approved by the Animal Center for 
Comparative Medicine at Baylor College of Medicine. COUP-TFIP”*"* mice’ 
were previously generated by our laboratory, and COUP-TFII°”* mice were 
generated by our group using an approach as previously described’”. Pten!** 
mice* were obtained from H. Wu. The ARR2PB-Cre (PB-Cre) transgenic mice** 
were from F. Wang. The Smad4!le* mice? were from E. J. Robertson. PB-Cre, 
COUP-TFIF°*"°", COUP-TFIO* and Pten!*"* mice were backcrossed with 
C57BL/6 mice for at least five generations, and maintained according to the 
National Institutes of Health Guide for the Care and Use of Laboratory. COUP- 
TFII-floxed or COUP-TFII°’* mice were first crossed with Pten-floxed or 
Smad4-floxed mice. The resulting compound mice were then crossed with PB- 
Cre mice for conditional knockout or overexpression of COUP-TFII in the pro- 
state epithelium. 

Expression plasmids and siRNA. The full-length human COUP-TFII cDNA was 
cloned into pMSCV-puro/neo (Clontech), pcDNA5 (Flag tag) (Invitrogen) and 
pcDNA3.1 (Myc tag) (Invitrogen) to generate COUP-TFII expression plasmids. 
Overlapping PCR with primers was used to generate Flag-COUP-TFII-M1 and 
M2 constructs, which are inserted in the KPNI/Nhel sites of pcDNAS. Expres- 
sion plasmids for epitope-tagged SMAD2/3/4, glutathione S-transferase (GST)- 
SMAD4 and the SMAD-binding element (SBE)-Luc reporter were described 
previously’. COUP-TFII-specific short hairpin RNA (shRNA) constructs were 
generated previously”. In brief, the shRNA for COUP-TFII (5'-AGGTAACGTGA 
TIGATTCAGTATCTTA-3’ and 5'-AGCTCTTGCTTCGTCTCCC-3’) was 
cloned into pSUPER.retro (OligoEngine). SMART pool or single siRNA duplexes 
targeting COUP-TFII, Smad4 and control non-target siRNA were purchased from 
Dharmacon. 

Cell culture and reagents. PC3, VCaP, LNCaP and DU145 cells were obtained 
from the ATCC and cultured in DMEM or RPMI supplemented with 10% fetal 
bovine serum. RWPE-1 cells (ATCC) were cultured with keratinocyte serum- 
free medium (K-SFM; Invitrogen) with bovine pituitary extract and human 
recombinant EGF. Cells were transfected with siRNA duplexes (40-80 nM) using 
lipofectamine (Invitrogen) or Dharmacon transfection (Dharmacon) reagents 
according to the manufacturer’s instructions. COUP-TFII retrovirus was gene- 
rated and transfected as previously described". In brief, cells were transduced with 
retrovirus and cultured with 2 mg ml ' puromycin or 400 mg ml‘ neomycin and 
the selection was stopped as soon as the non-infected control cells died off. Cell 
proliferation assay was performed using CellTiter 96 non-radioactive cell proli- 
feration assay (MTT) kit (Promega) assay according to the manufacturer’s instruc- 
tions. Results were calculated based on three independent experiments and 
statistical significance was determined by two sided Student's t-test. 

Luciferase reporter assay. Reporter assays were carried out as described''. In 
brief, 20-24h after transfection, cells were treated with TGF-B (10 ng ml!) or 
$B431542 compound (20 nM) for 24h. Cells were then collected for measurement 
of luciferase and B-galactosidase activities. All assays were done in duplicate and all 
values were normalized for transfection efficiency against f-galactosidase acti- 
vities. Luciferase activity was determined with the Promega luciferase assay kit. 
Results were quantified based on three independent experiments and statistical 
significance was determined by two sided Student's t-test. P values less than 0.05 
were considered significant. 

RNA isolation and qPCR. Total RNA was extracted using TRIzol followed by the 
RNeasy mini kit (Qiagen) cleanup and RQ1 RNase-free DNase set treatment 
(Promega) according to the manufacturer’s instructions. First-strand cDNA was 
synthesized using 2 jg of total RNA and Superscript II (Invitrogen). TaqMan 
universal master mix reagents and inventoried primer/probe mixtures (Applied 
Biosytems) were used for the reaction. Standard curves were generated by serial 
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dilution of a preparation of total RNA, and all messenger RNA quantities were 
normalized against 18S RNA using ABI ribosomal RNA control reagents. The 
primers/probes used in this study are as follows: COUP-TFII (Mm00772789_m1), 
AR (Hs00171172_m1), probasin (Mm00444381_m1), NKX3.1 (Mm00440479_m1) 
and p21 (Hs00355782_m1). A Student’s t-test was used for statistical analysis of 
qPCR results, and P values less than 0.05 were considered significant. 

Histology and immunohistochemistry. The ventral, dorsolateral and anterior 
prostate lobes were dissected, fixed in 4% formaldehyde in PBS for 24h, processed 
and embedded in paraffin. Sections (7 ym) were processed for H&E staining. 
Prostate hyperplasia is characterized by the proliferation of luminal cells with no 
cytological atypia, but contain small foci with two or three layers of cells. The PIN 
lesions were graded using the nomenclature and criteria developed previously”’. In 
brief, high-grade PIN is characterized by an intraglandular proliferation of crow- 
ding cells with atypia, and cribriform formation or the development of multi- 
layered solid glandular structures. Invasive adenocarcinoma is characterized by 
the proliferation of atypical cells that break the basal membrane and invade 
through the prostatic stroma. The quantitative results of tumour progression are 
from two random slides of each mouse in a total 10 pairs of the entire animal 
cohort. For immunohistochemistry, the slides were processed using a citrate- 
buffer-based antigen retrieval and the avidin-biotin peroxidase immunohistoche- 
mistry method. Sections were counterstained with 4’ ,6-diamidino-2-phenylindole 
(DAPI; Sigma-Aldrich). Quantification of tumour malignancy was done as follows: 
for Ki67 staining, positive cells were counted after imaging with a Zeiss Axioplan 
microscope. Three random slides for each animal were counted. The proliferative 
rate was determined by counting the proliferating cells (Ki67-positive) in a total of 
100 cells, and the statistical significance was determined with a two-tailed Student’s 
t-test. For lymph node metastasis, we performed CK8 or AR immunostaining in 
lymph nodes and recorded the percentage of mice that were positive for CK8 or AR, 
which we considered as lymph node metastasis. For invasive adenocarcinoma 
identification, we performed SMAo immunostaining. If the gland or acini displayed 
a loss or breakdown of the continuous layer of basal membrane (SMAc-positive) 
and the appearance of AR- or CK8-positive epithelial cells invading into the stromal 
compartment, we consider it as microinvasive cancer. The statistical significance 
between groups was determined with a two-tailed Fisher extract test. 

Western blot and chromatin immunoprecipitation assay. Total proteins were 
extracted from cells following standard protocol’. Nuclear proteins were extracted 
using the NE-PER kit (Pierce). Protein concentration was measured using the 
BCA protein assay kit (Thermo Scientific). The primary antibodies used in this 
study were as follows: COUP-TFII (Perseus Proteomics), -actin (Santa Cruz), 
phophorylated-H3 (Cell Signaling), PCNA (Santa Cruz), phospho-AKT403 (Cell 
Signaling), pan AKT (Cell Signaling), SMAD4 (Santa Cruz), p21 (BD), phospho- 
SMAD2 (Cell Signaling), SMAD2 (Zymed), SMAD3 (Zymed), cyclin D1 (Sigma), 
p15 (Santa Cruz) and p53 (Santa Cruz). Horseradish peroxidase (HRP)-conjugated 
secondary antibodies were purchased from DAKO. Signals were visualized with the 
Super Signal West Pico Chemiluminescent Substrate kit (Pierce). Chromatin 
immunoprecipitation assays were performed as described as previously’! according 
to the protocol provided by Millipore. All the results are obtained from three 
independent samples and statistical significance was determined by a two-sided 
Student’s t-test. 

Immunoprecipitation and western blot analysis. Immunoprecipitation was 
performed as described previously’. Endogenous or epitope-tagged proteins were 
immunoprecipitated from cell lysates by the appropriate antibody affinity gel as 
indicated in the text and figure legends. After extensive washes, immunoprecipi- 
tated proteins were eluted in SDS sample loading buffer (Bio-Rad), separated by 
SDS-PAGE, transferred to nitrocellulose (Pierce) and detected in western blots 
with appropriate primary antibodies coupled with HRP-conjugated secondary 
antibody by chemiluminescence (Pierce). 

GST fusion protein, in vitro protein binding and pull-down assays. GST fusion 
proteins were prepared using a commercial kit (Amersham Pharmacia Biotech). 
Target protein was pre-cleared with 5 1g of GST protein for 1 h and then incubated 
with 2 ug of various GST-fusion proteins for 2h in the binding buffer (50 mM 
Tris-HCl, pH7.5, 120mM NaCl, 2mM EDTA, 0.1% NP40). Proteins bound to 
GST-fusion proteins were retrieved on glutathione-sepharose beads, separated by 
SDS-PAGE and examined by western blot. 

Electrophoretic mobility shift assay. Nuclear proteins from PC3 cells treated 
with TGF-B for 1h were extracted using NE-PER kit (Pierce). Complementary 
oligonucleotides corresponding to the p21 SBE and its mutants were annealed and 
end labelled with [y-*’P]ATP. The binding reactions were performed and analysed 
on a 5% non-denaturing gel. For supershift studies, 1 jig of antibody against 
SMAD4 (Santa Cruz) was preincubated for 20min on ice, and DNA-protein 
complexes were visualized by autoradiography. 

Microarray analysis. For microarray analysis, total RNA was isolated from con- 
trol and COUP-TFII-knockdown PC3 cells. The Microarray Core at Baylor 
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College of Medicine performed the microarray hybridization with Affymetrix 
Human Genome U133 Plus 2.0. Gene information for all probes was annotated 
based on ‘Human133_2.annot.csv’ downloaded from the Affymetrix website. Raw 
data can be found in the GEO database as accession number GSE33182. The list of 
significance was operated by setting a false discovery rate (FDR) threshold at level 
of 0.05. All differentially expressed gene lists generated as described above were 
further analysed with the Ingenuity Pathways Analysis program (http://www. 
ingenuity.com/index.html) to identify canonical pathways, and molecular and 
cellular functions enriched in the related gene lists. 

Clinical outcome analysis using GSE1064 data set. We defined a gene expres- 
sion signature for COUP-TFII activity using an external human tumour data set 
and determined the correlation of COUP-TFII signature patterns with time to 
prostate cancer-specific death in patients. The COUP-TFII gene signature was 
derived from our own gene expression profile data set, of cells with or without 
COUP-TFII knockdown (GEO accession GSE33182, from ref. 21). The COUP- 
TFII gene signature (defined using fold change >1.5 for each COUP-TFII siRNA 
sample profile versus each control sample profile) consisted of 1,209 Affy probe 
sets (representing 829 unique genes) induced with COUP-TFII knockdown and 
1,328 probe sets (891 genes) repressed with COUP-TFII knockdown (that is, the 
expression of 891 genes is induced by COUP-TFII, and 829 genes are repressed by 
COUP-TFII; Supplementary Data File). We used the previously described ‘t-score’ 
metric’ °° to define the COUP-TFII gene signature within profiles from an 
external human tumour data set (GEO accession GSE10645, using the ‘human 
cancer panel’, or GPL5858 platform). In brief, the t-score was defined for each 
external profile as the two-sided t-statistic comparing, within the profile, the 
average of the COUP-TFIl-induced genes with the average of the COUP- 
TFll-repressed genes (genes within the GSE10645 data set were first centred to 
standard deviations from the median). The GSE10645 mRNA profile data set 
consisted of genes in a focused panel of 488 (which did not include COUP-TFI 
mRNA itself), of which 80 were represented in our COUP-TFII signature; for the 
80 genes (47 induced by COUP-TFII siRNA and 33 repressed), the t-score con- 
trasted the patterns of the COUP-TFII-induced genes against those of the COUP- 
TFll-repressed genes, to derive a single value denoting coordinate expression of 
the two gene sets (Supplementary Fig. 7). The univariate Cox proportional hazards 
analysis, multivariate Cox analysis, and Kaplan-Meier survival analysis were used 
to determine the correlation of COUP-TFII signature with patient outcome as 
described below. Furthermore, we performed a univariate analysis for each feature 
in the GSE10645 data set, and found the overall FDR to be reasonable. Out of 502 
total features represented on the focused array, 131 had a univariate Cox P < 0.01, 
yielding an estimated FDR of (502 X 0.01)/131 = 3.8%. This indicates that there 
are widespread correlation patterns with outcome represented in the data set, 
similar to what one might find for other cancer profiling data sets. We also tested 
the ability ofa random gene signature to predict outcome, using simulation testing 
in which t-scores were computed for each of 10,000 randomly generated signa- 
tures of the same gene number as the actual signature; for 95.8% of the random 
signatures, the correlation with worse patient outcome (as assessed by univariate 
Cox B value) did not exceed that for the actual signature. 

Correlation of COUP-TFII signature with PSA recurrence. In addition to the 
GSE10645 (ref. 21) data set, we examined the signature in three other gene array 
data sets of human prostate tumours, from refs 37-39. In these other data sets, the 
outcome endpoint was PSA recurrence, whereas that for the GSE10645 data set 
was prostate-cancer-specific death. For multiple human tumour gene array data 
sets including GSE10645, we noted significant overlap between the genes 
repressed by COUP-TFII knockdown and the genes associated with worse out- 
come (Supplementary Table 4). However, when computing signature t-scores for 
each of the four data sets, the signature correlated with worse outcome for 
GSE10645 (ref. 21) (using prostate-cancer-specific death as the outcome measure) 
and ref. 37 data sets, but not with the refs 38 and 39 data sets, and not with time to 
PSA recurrence in the ref. 37 data set (Supplementary Table 5). Therefore, we 


conclude that the signature correlates with shorter time to prostate-cancer-specific 
death, with time to recurrence event having a weaker association. 

Analysis of the enrichment of TGF-B-induced genes in COUP-TFII-depleted 
PC3 cells. A one-sided Fisher’s exact test determined the significance of gene 
overlap between our COUP-TFII gene signature (as described above) and a previ- 
ously generated gene signature of TGF-f activity (as reported previously’’). Both 
data sets were generated using the Affymetrix U133 Plus 2 platform; therefore, the 
mapping between the two gene signatures was based on Affymetrix probe set 
identifier, with the gene population being the entire set (~54,675) represented 
on the gene array. 

Statistical analysis. All experiments were performed using 3-15 mice or three 
independent repeated experiments from cells. Data are presented as mean and 
s.e.m. Spearman correlation coefficients were used to evaluate the relationships 
between COUP-TFII and clinicopathological variables or gene expression. The 
predictive value of COUP-TFII expression univariately with other clinical and 
pathological variables was analysed using the Cox proportional hazards regression 
model and the hazard ratio, and 95% confidence intervals were computed. The 
multivariate Cox proportional hazards model with the five-gene signature was 
used to estimate the coefficients of individual genes, which combined the five-gene 
expression levels into an integrated risk score model defined. Kaplan-Meier sur- 
vival curves for different levels of COUP-TFII or COUP-TFII signature were also 
plotted. All analyses are performed with statistical software SPSS 19.0 (IBM, SPSS 
Statistics) or WinStat for Excel (R. Fitch software). Statistical significance was 
determined by a Student's t-test, Spearman correlation coefficients test, log-rank 
test or Fisher’s exact test. For all statistical tests, the 0.05 level of confidence (two- 
sided) was accepted for statistical significance. 
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Recombination-restarted replication makes 
inverted chromosome fusions at inverted repeats 


Ken’Ichi Mizuno!, Izumi Miyabel, Stephanie A. Schalbetter!, Antony M. Carr! & Johanne M. Murray! 


Impediments to DNA replication are known to induce gross chromo- 
somal rearrangements (GCRs) and copy-number variations (CNVs). 
GCRs and CNVs underlie human genomic disorders’ and are a fea- 
ture of cancer’. During cancer development, environmental factors 
and oncogene-driven proliferation promote replication stress. Re- 
sulting GCRs and CNVs are proposed to contribute to cancer deve- 
lopment and therapy resistance’. When stress arrests replication, the 
replisome remains associated with the fork DNA (stalled fork) and is 
protected by the inter-S-phase checkpoint. Stalled forks efficiently 
resume when the stress is relieved. However, if the polymerases dis- 
sociate from the fork (fork collapse) or the fork structure breaks 
(broken fork), replication restart can proceed either by homologous 
recombination or microhomology-primed re-initiation*’. Here we 
ascertain the consequences of replication with a fork restarted by 
homologous recombination in fission yeast. We identify a new me- 
chanism of chromosomal rearrangement through the observation 
that recombination-restarted forks have a considerably high propen- 
sity to execute a U-turn at small inverted repeats (up to 1 in 40 
replication events). We propose that the error-prone nature of re- 
started forks contributes to the generation of GCRs and gene amplifi- 
cation in cancer, and to non-recurrent CNVs in genomic disorders. 

In eukaryotes, multiple origins are licensed but only a subset fire. 
If one fork collapses, replication is completed by a converging fork’. 
Alternatively, if both converging forks collapse, dormant origins can 
fire to rescue the situation’. However, when converging forks collapse 
without an intervening dormant origin, that is, at a fragile site®”, or if 
a single fork collapses at a unidirectionally replicated locus"®, one re- 
plisome will likely be rebuilt by HR. To study replication-fork collapse 
and restart we use a programmed replication-terminator sequence 
(RTS1) to arrest the replisome at a defined genomic locus in fission 
yeast'!”, Fork arrest at RTS1 is controlled by regulating rtf1* trans- 
cription"’. Rtfl, a Myb-like DNA-binding protein, is required for arrest 
at RTS1. After induction, >90% of forks arrest at RTS1 and require HR 
proteins to restart’. 

In Schizosaccharomyces pombe collapsed forks restart by an HR- 
dependent, but double-strand-break (DSB)-independent, mecha- 
nism’*’. Our model (Supplementary Fig. 1) suggests that HR proteins 
associate with the nascent strand behind the collapsed fork and sub- 
sequent strand invasion at the collapse site facilitates accurate HR- 
dependent fork restart with the correct template. However, if a DNA 
sequence homologous to the collapse site is nearby, an erroneous 
strand invasion can occur such that replication reinitiates ectopically. 
This leads to non-allelic homologous recombination (NAHR)"”. 
When the homologous sequences are in an inverted-repeat orienta- 
tion, NAHR associated with inaccurate restart results in acentric and 
dicentric isochromosomes!*. We also observed that, when the fork 
barrier sequence formed the flanking regions of a small palindrome”, 
GCR rates increased ~tenfold (contrast the two constructs RuraR and 
RuiuR, in which a second inverted copy of ura4* has been inserted; 
Supplementary Fig. 2). The main distinction between the two con- 
structs is that RuiuR contains RTS/ in context of the 5.3-kilobase (kb) 
palindrome as opposed to an inverted repeat separated by 1.8kb 


(RuraR). We thus speculated that, upon NAHR, branch migration of 
the invading strand (which is not possible in the inverted-repeat con- 
struct) formed a single Holliday junction at the palindrome centre, 
which drove the increased chromosome rearrangement. 

To prevent the predicted half-crossover migrating in RuiuR to the 
palindrome centre we replaced 550 base pairs (bp) of the centromere- 
proximal ura4* gene with 0.2 or 1.8 kb of his3*, creating RpallR and 
Rpal2R, respectively (Fig. 1a). To prevent any possibility of rearrange- 
ment by NAHR we created two further constructs in which the telo- 
mere-proximal RTS1 sequences of RpallR and Rpal2R were replaced 
with three copies of the ribosomal DNA fork barrier sequence, TER2/3, 
to create TpallR and Tpal2R. In these constructs, “I” represents the 
ribosomal DNA barrier, ‘R’ represents RTS1 and ‘pal’ refers to the 
presence of the inverted 1.2-kb repeat. TER2/3 serves simply to pause 
the converging fork, allowing more time for the RTS1-collapsed fork to 
restart. TER2/3 differs from RTS1 in sequence and arrests forks in an 
Rtfl-independent fashion’*”* (Supplementary Fig. 3a, b). Unlike RTS1, 
where forks collapse and require HR to rebuild the replisome for 
restart, forks pause transiently at TER2/3, do not require HR for re- 
sumption, and the arrest site does not accumulate Rad52. GCRs are 
thus not induced (Supplementary Fig. 3c-f). 

To assay for GCRs, fork arrest was induced at RTS1 by inducing Rtfl 
(TER2/3 arrest is constitutive) and genomic DNA was analysed by 
Southern blot. We predicted no GCRs in strains with a single RTS1 
(TpallR and Tpal2R) and were interested in establishing whether 
double-RTS1 constructs (RpallR and Rpal2R) appreciably reduced 
GCR levels (Fig. 1a). Notably, all four constructs generated GCRs in 
an RTS1 fork-arrest-dependent manner (Fig. 1b, c). Double-RTS1 sys- 
tems accumulated 25-30% GCRs, similar to that observed in RuiuR, 
whereas single-RTS1 strains showed ~5-15% rearrangement. 

NAHR between RTS1 sequences occurring upon restart should pro- 
duce dicentric chromosomes with an expected BglII fragment of 
15.2 kb (Rpall R) or 15.8 kb (Rpal2R); however, the observed fragment 
lengths were 14.8 kb and 18 kb, respectively. These correspond to dou- 
ble the size from the centromere-proximal BglII site to the palindrome 
centre. Identical-size fragments are observed in the corresponding 
single-RTS1 constructs. These data suggest a novel mechanism of 
chromosomal rearrangement, in which the collapsed replication fork 
resumes accurately with the correct template, but later reverses the 
orientation of DNA replication (U-turn) as it replicates through the 
palindrome centre. This leads to isodicentric chromosome formation. 

To characterize the effect of palindrome size in promoting restarted- 
fork-dependent GCRs, a series of constructs was made in which the 
palindrome size (P(bp)) varied between P(74) and P(2,400), but its 
centre of symmetry remained a constant distance from the site of 
fork restart (Fig. 2a). All constructs contained the 14-bp interrupting 
sequence at the palindrome centre. To establish GCR levels, genomic 
DNA was analysed by Southern blot using two flanking probes, probe 
pA or pB. 

For P(2,400), pB revealed that ~ 10% of the DNA corresponded to the 
rearranged product, migrating at 14.4 kb (Fig. 2b). This is twice 7.2 kb, 
the distance from the palindrome centre to the centromere-proximal 
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Figure 1 | Alternative mechanism for inverted chromosomal fusion. 

a, Cartoon of double- and single-RTS1 constructs. Replication origins (ARS) on 
chromosome 3, their distance from ura4 and predicted efficiencies are shown. 
Cen denotes centromere 3. Concave light blue and dark blue boxes represent 
RTS1 or 3 X TER2/3 as indicated. Yellow arrows/boxes represent ura4 
sequences and green box represents his3 sequences. Black arrow indicates 
inverted repeats of the ura4 sequences. Open triangle shows 14-bp interrupting 
sequence at the palindrome centre. Red bars represent probes. B indicates BglII 
restriction site. Sizes of initial and predicted dicentric chromosome BgllI 
fragments generated by replication-template exchange are shown for each 
relevant strain. A kb indicates size of inserted unique sequence (green). b, A 
representative Southern blot of double- and single-RTS1 strains with arrest off 
or arrest on. Genomic DNA was digested with BglII and probed with pB. 

c, Quantification of rearranged fragment in b. Mean and + s.d. of values are 
calculated from at least three independent experiments. 


BglII site. pA revealed a similar proportion of a 5.4-kb fragment (twice 
2.7 kb, the distance between the palindrome centre and the telomere- 
proximal BgllI site; Supplementary Fig. 4). These rearranged products 
correspond, respectively, to dicentric and acentric isochromosomes 
(data not shown). As expected, P(0), which has no inverted repeats, 
showed no detectable rearrangement products, confirming that the 
14.4-kb and 5.4-kb signals do not represent replication intermediates 
or broken forks. All induced rearrangements were dependent on replica- 
tion-fork arrest at RTS1 and the percentage of GCR was dependent on 
palindrome size (Fig. 2b, c and Supplementary Fig. 4b, d). P(314) was the 
smallest palindrome allowing GCR detection by Southern blot analysis. 

Palindromes are prone to forming secondary structures, including 
cruciforms (double-strand DNA) and hairpins (single-strand DNA). 
Secondary-structure formation is influenced by interrupting-sequence 
size'*’”. To establish whether GCR formation was related to interrupting- 
sequence size, we used P(1,214) as a base construct and varied the inter- 
rupting sequence (Fig. 2d). A 7-bp interrupting sequence showed slightly 
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Figure 2 | Rearrangement frequency is dependent on the repeat size and 


interrupting-sequence size. a, Cartoon of constructs with varying repeat size. 
P(2,400), P(0) (no inverted repeat) and intermediates P(W) are indicated as in 
Fig. 1a. W represents the size of the whole palindrome in base pairs. X denotes 
the size of the ura4 fragment creating the inverted repeat. Grey box indicates 
heterologous sequence (V). The sum of X and V is always 1,200 bp. b, Southern 
blot analyses of P(W) strains for arrest off or arrest on. rtf1A indicates strain 
deleted for the rtfl gene. Genomic DNA was digested with BglII and probed 
with pB. c, Quantification of rearranged fragment in b. d, Cartoon of constructs 
varying interrupting sequence (IS) (P(1,200)IS(Y)), indicated as in Fig. 2a, in 
which Y represents the size (in base pairs) of the interrupting sequence. 

e, Southern blot analysis of P(1,200)IS(Y) strains for arrest off or arrest on. 
Southern blot was performed as described in Fig. 1b. f, Quantification of 
rearranged fragment in e, as described in Fig. 1c. Mean and = s.d. of values in 
cand f are calculated from at least three independent experiments. 


higher GCR levels than a 14-bp interrupting sequence. A 28-bp inter- 
rupting sequence reduced levels by ~threefold, to 2%. This did not re- 
duce further when the interrupting sequence was increased to 250 bp 
(Fig. 2e, f). These results indicate that a potential for structured DNA 
formation promotes restarted fork U-turn, but is not essential. 

The data from Fig. 1b, c suggested that the distance of the palin- 
drome centre from the site of restart influences the U-turn frequency. 
To clarify this, a further series was constructed in which different sizes 
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of heterologous sequence separated a 1.2-kb palindrome and RTS1 
(Fig. 3a). The maximum amount of GCR was observed when the pa- 
lindrome directly abutted RTS1 (~8%). As the distance between the 
palindrome and RTSI1 was increased to ~1.5kb, the GCR level 
decreased ~fourfold, to 2%. Further extension of the distance did 
not result in further decreases (Fig. 3b, c). These data indicate that 
the restarted fork is initially highly error prone, but matures as it travels 
the first few kilobases. However, the constant rate of GCR observed in 
constructs separating the palindrome from RTS1 by >1.5 kb implies 
that such ‘matured’ forks are non-canonical and remain error prone. 

Although it has not been possible (owing to the limitations of our 
system) to establish whether recombination-restarted forks become 
error free over greater distances, it is notable that break-induced rep- 
lication forks analysed in Saccharomyces cerevisiae’* remain prone to 
replication slippage’? and template exchange with homologous chro- 
mosomes” over tens to hundreds of kilobases. Break-induced replica- 
tion initiates from a DSB and occurs outside of S phase in G2-arrested 
cells. Our recombination-restarted forks restart without a DSB inter- 
mediate during S phase. It is thus unclear how closely the two systems 
equate and whether similar replisome configurations underlie restarted 
replication in both systems. 

NAHR occurring during the restart event generates acentric and 
dicentric isochromosomes containing the intervening sequence origi- 
nally present between the inverted RTS1 repeats. Conversely, the iso- 
chromosomes generated by recombination-restarted forks executing 
a U-turn at the palindrome centre contain either two centromere- 
proximal or two telomere-proximal sequences (defined from the cen- 
tre of the inverted repeat/palindrome). Thus, we can establish the 
relative contribution of these two distinct mechanisms in generating 
isochromosomes from the double-RTS1 palindrome construct RpallR. 

Double-RTS1 constructs (Fig. 1a) potentially cause isochromosome 
formation by either NAHR or recombination-restarted fork U-turn. 
The derivatives with a single RTS1 sequence cannot undergo NAHR 
and only generate isochromosomes by recombination-restarted fork 
U-turn. We used Southern blot to distinguish these products (Fig. 4a). 
The main mechanism for the rearrangement for inverted RTS1 con- 
structs is recombination-restarted fork U-turn (Fig. 4b), with a minor 
contribution from NAHR. Therefore, the original palindromic RuiuR 
construct, which showed a much higher level of isochromosome for- 
mation compared with the original RuraR construct!”, does so because 
of an additional defect associated with forks restarted by HR. 

The junction consists of two sister chromatids fused at the repeat 
centre, suggesting that the recombination-restarted replisome per- 
forms a U-turn by exchanging template strands between the repeats. 
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Figure 3 | Fidelity of HR-restarted fork improves with distance. a, Cartoon 
of constructs of varying distance from the restart site (P(1,200)D(Z)) (as 
indicated in Fig. 2a). The whole palindrome is 1,214 bp. Z indicates distance (D) 
of the palindrome from RTS1 in kilobases. b, Southern blot analysis of 
P(1,200)D(Z) strains for arrest off or arrest on. Analysis was performed as 
described in Fig. 1b. c, Quantification of rearranged fragment in b, as described 
in Fig. 1c. Mean and + s.d. of values are calculated from at least three 
independent experiments. 
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This would be consistent with the nascent strand frequently dissocia- 
ting transiently from, and then re-annealing to, its template. Synthesis 
continuing on an incorrect inverted template would result in a ‘closed 
Y’ structure at the repeat centre. Intriguingly, large interrupting se- 
quences reduce, but do not eliminate, the rearrangement, suggesting 
that homology either with or without structured DNA can drive a 
U-turn. 

The genome-rearrangement profiles in cancer are complex, inclu- 
ding simple CNVs, chromothripsis (multiple linked rearrangements)”’, 
translocations and gene amplifications that often initiate from isochro- 
mosomes’. Similarly, rearrangements in genomic disorders include 
simple recurrent CNV caused by NAHR during meiosis (or occasion- 
ally in mitotic cells), inverted duplication deletions”, non-recurrent 
CNVs typified by microhomology (or no homology) at the junction”, 
and complex multi-junction events that, in some cases, suggest multiple 
contiguous replication errors*”*”. 

Our data show that recombination-restarted forks are error prone, 
with an unexpectedly high propensity (up to 1 in 40 events) to U- 
turn between short inverted repeats that can be separated by many 
kilobases. Likewise, recombination-restarted forks cause increased 
microhomology-dependent insertions and deletions**. Current mo- 
dels for the generation of replication-associated rearrangements al- 
most invariably assume a DSB as the initiating event, which subse- 
quently undergoes an incorrect choice of restart site on the basis of 
homology or microhomology. We propose that inaccurate replication 
from forks correctly restarted without a DSB intermediate also makes 
an important contribution to genome rearrangement. Once a fork is 
restarted at the correct sequence it is particularly prone to U-turn 
between inverted repeats. Although our physical assay can detect these 
events at inverted repeats of ~150 bp, the relationship between repeat 
size and frequency (Fig. 2c) suggests that shorter repeats will still 
generate an appreciable rate. 

Although we cannot directly establish whether recombination- 
restarted forks are responsible for genome rearrangements in human 
cells, our data predicts that isochromosome formation in cancer cells will 
be elevated at fragile sites, where replication forks are prone to collapse 
and low-origin density necessitates fork restart. Interestingly, isodicen- 
tric chromosome formation and subsequent breakage-fusion-bridge 
cycle-dependent rearrangements initiate gene amplification”, and fra- 
gile sites have been associated with amplification boundaries as well as 
other cancer-related GCRs**. Similarly, slippage at microhomology by 
recombination-restarted forks probably contributes to the frequent 
CNVs associated with cancer’””®. 

Equally, several classes of genomic disorders are compatible with a 
contribution from recombination-restarted forks: one well-characterized 
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Figure 4 | U-turn at palindrome centre is chief mechanism for inverted 
fusion in double-RTS1 constructs. a, Southern blot analyses of RuiuR, RpallR 
and TpallR for arrest off or arrest on. Genomic DNA was digested with BglII 
and probed with pB or pU3 (see Fig. 1a). Note that most of the rearrangement 
in Rpal1R detected by pU3 is acentric 6.6 kb. b, Model for error-prone 
progression of a recombination-restarted replication fork. Oval, blue concave 
box and yellow box denote replication origin, obstacle and repeat sequences, 
respectively. When a replication fork collapses, HR restarts the collapsed fork. 
However, the restarted fork is non-canonical and error prone, causing GCRs at 
inverted repeats (IR) owing to the execution of a U-turn. 
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rearrangement involves a triplicated segment embedded within a dupli- 
cation” mediated by small inverted low-copy repeats. ‘Duplication- 
inverted triplication-duplication’ involves two ‘breakpoint’ junctions, 
one within the repeats and a second showing microhomology. Such a 
rearrangement can be explained by two distinct events associated with a 
single-fork restart. Likewise, several other genomic disorders result from 
inverted duplication deletions”, which are predicted to be the stabilized 
events of breakage of an isodicentric chromosome during breakage- 
fusion-bridge cycles. 


METHODS SUMMARY 


Standard genetics and molecular biology techniques. Strains were constructed 
using standard genetic techniques*’. The S. pombe strains used in this study are 
listed in Supplementary Table 1. Culture conditions, genomic DNA preparation in 
agarose plugs, Southern blot techniques and the quantification of rearranged DNA 
were performed as described in refs 12 and 13. Genomic DNA was digested with 
150 units of BglII in the recommended buffer. Probes pA and pB are described in 
ref. 12 as Cen and Tel, respectively. Probe pU3 is a 550-bp fragment of ura4 ge- 
nomic DNA digested by EcoRV and Spel, probe ura is a 1.8-kb ura4 fragment, and 
probe ura45 is a HindIII Spel fragment comprising the wra4* and uraS* genes. 
TER2/3 ribosomal DNA fork barrier. Primers prr1-forward (5'-p-AATTCTAC 
TACTATTTTGTGCATTACCCTTACCTTTTTTTTC-3’) and prr1-reverse (5'- 
p-AATTGAAAAAAAAGGTAAGGGTAATGCACAAAATAGTAGTAG-3’) were 
annealed and ligated. The TER2/3 consensus sequence is underlined'*. The ligated 
DNA was digested with EcoRI and Mfel to eliminate inverted-repeated configuration 
and size-fractionated by agarose gel, and fragments of ~130 bp (3 tandem repeats) 
were used to replace the telomere-proximal RTS1. The construct was confirmed by 
sequencing. 
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Structure of the proton-gated urea channel from the 
gastric pathogen Helicobacter pylori 
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Half the world’s population is chronically infected with Helico- 
bacter pylori’, causing gastritis, gastric ulcers and an increased 
incidence of gastric adenocarcinoma’. Its proton-gated inner- 
membrane urea channel, HpUrel, is essential for survival in the 
acidic environment of the stomach’. The channel is closed at neut- 
ral pH and opens at acidic pH to allow the rapid access of urea to 
cytoplasmic urease*. Urease produces NH; and CO;, neutralizing 
entering protons and thus buffering the periplasm to a pH of 
roughly 6.1 even in gastric juice at a pH below 2.0. Here we report 
the structure of HpUrel, revealing six protomers assembled in a 
hexameric ring surrounding a central bilayer plug of ordered 
lipids. Each protomer encloses a channel formed by a twisted bun- 
dle of six transmembrane helices. The bundle defines a previously 
unobserved fold comprising a two-helix hairpin motif repeated 
three times around the central axis of the channel, without the 
inverted repeat of mammalian-type urea transporters. Both the 
channel and the protomer interface contain residues conserved 
in the AmiS/Urel superfamily, suggesting the preservation of 
channel architecture and oligomeric state in this superfamily. 
Predominantly aromatic or aliphatic side chains line the entire 
channel and define two consecutive constriction sites in the middle 
of the channel. Mutation of Trp 153 in the cytoplasmic constriction 
site to Ala or Phe decreases the selectivity for urea in comparison 
with thiourea, suggesting that solute interaction with Trp 153 con- 
tributes specificity. The previously unobserved hexameric channel 
structure described here provides a new model for the permeation 
of urea and other small amide solutes in prokaryotes and archaea. 

Treatment of H. pylori infection is becoming less effective as a result 
of increasing antibiotic resistance, suggesting that a specifically tar- 
geted approach to eradicate this organism would be beneficial*. Colo- 
nization of the acidic mammalian stomach by H. pylori depends on the 
presence of the inner-membrane protein HpUrel, making it a viable 
clinical target’. HpUrel is a proton-gated urea channel that is closed at 
pH7.0 and fully open at pH 5.0, enabling the rapid entry of urea into 
the bacterium*. Urease activity (Supplementary Fig. 1) buffers the 
periplasm to pH6.1, which is essential for the survival of H. pylori 
at acidic pH®”’ as a result of the neutralizing capacity of NH; and 
the generation of HCO; by periplasmic «-carbonic anhydrase. Con- 
versely, the closure of HpUrel at neutral pH prevents over-alkalization 
in the presence of millimolar urea, and consequent cell death®. 

The structure of HpUrel was determined using multiwavelength 
anomalous dispersion (MAD; Methods). The structure shows an 
arrangement of six protomers that form a compact hexameric ring 
(Fig. 1) about 95 A in diameter and 45 A in height. The centre of the 
hexamer is filled with an ordered lipid plug that forms an asymmetric 
bilayer with electron density for six lipid tails in the periplasmic leaflet 
and for 18 tails in the cytoplasmic leaflet. This central lipid plug is 
reminiscent of those reported for other membrane protein oligomers 


such as bacteriorhodopsin’. The main inter-protomer contacts are 
between TMH1 and TMH2 of one protomer and TMH3 and TMH4 
of a neighbouring protomer, a region with conserved residues that are 
probably important for assembly (Fig. 2a). Native gel electrophoresis 


Figure 1 | The HpUrel urea channel hexamer. a, Ribbon diagram of hexamer 
surrounding the lipids of the central bilayer (purple sticks). The C6 hexamer is 
generated from the three protomers of one asymmetric unit (green) by the 
crystallographic two-fold axis (second asymmetric unit in teal). b, Electrostatic 
potential at the periplasmic hexamer surface computed at pH 5.3, the pH at 
which the crystals were grown (red, —4kT/e, blue, +4kT/e). The electrostatic 
potential was calculated with the program APBSmem””. The red arrow 
pinpoints the entrance to one of six urea channels. 
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Figure 2 | Residues lining the channel. Residues conserved in the AmiS/Urel 
superfamily (aligned in Supplementary Fig. 11) are clustered in the channel and 
at the protomer interface. a, View from the periplasm showing the conserved 
residues of protomer A (gold ribbon) and the open channel of protomers B and 
C (green and blue ribbons, respectively) when only the protein backbone is 
shown. For clarity, only selected side chains are labelled. Residues conserved in 
all members of the AmiS/Urel superfamily are white; additional residues 


of HpUrel confirms that it is a hexamer (Supplementary Fig. 4), in 
contrast to the previously postulated trimer’®. An electron microscopy 
9-A projection map of a Bacillus cereus amide channel, also a member 
of the AmiS/Urel family, revealed a similar hexameric arrangement". 
Thus, the hexamer probably represents the physiological state rather 
than a crystallization artefact. 

Sequence analysis had predicted six transmembrane segments with 
periplasmic location of both amino and carboxy termini, generating 
two periplasmic loops (PLs), one between transmembrane helix 
(TMH)2 and TMH3 (PL1) and another between TMH4 and TMH5 
(PL2), as well as a short periplasmic C-terminal segment’. The crystal 
structure shows that each HpUrel protomer is a twisted bundle of six 
slightly tilted transmembrane helices whose inward-facing side chains 
define a central channel with a unique arrangement of residues (Fig. 2). 
Each protomer exhibits non-inverted three-fold pseudosymmetry as 
demonstrated by the high structural similarity after 120° and 240° 
rotations around an axis through the centre of the channel (Fig. 3). 
The repeating motif is a helical hairpin composed of a pair of anti- 
parallel TMHs connected by a short cytoplasmic loop. However, closer 
helix-helix packing is observed between helices from adjacent 
repeated motifs, namely TMH2 and TMH3, TMH4 and TMHS, and 
TMH1 and TMH6 (Fig. 2a). The mitochondrial ADP/ATP exchanger 
has a clear internal threefold tandem repeat of a sequence motif in its 
amino-acid sequence’, whereas the three-fold pseudosymmetry of 
HpUrel became apparent only after the crystal structure had been 
determined. Sequence alignments suggest that this motif is present 
in eubacteria and archaea (Supplementary Fig. 11), implying an 
ancient evolutionary origin. 

The channel is lined by conserved residues that identify the solute 
pathway in the AmiS/Urel channel superfamily, and specifically the 
pathway for urea through HpUrel. The beginning of TMH] is recessed 
within the helix bundle and starts with Met 1, whose ordered side chain 
was crucial for selenomethionine phasing. TMH1 contains 3 of the 11 
channel residues that are conserved in the AmiS/Urel superfamily 
(Fig. 2 and Supplementary Fig. 11), all facing inwards where they 
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conserved in the subset of known urea channels (including HpUrel) are yellow. 
The arrow points to the proton-sensing periplasmic loop 2 (PL2) of 
protomer A. b, View parallel to the membrane, with fully conserved residues 
(white) clustering in the middle of the channel pore, and residues additionally 
conserved in the urea channels in yellow. The red arrows show the regions of 
urea entry (top, periplasmic side) and exit (bottom, cytoplasmic side). 


define one side of the channel. The longest helix, TMH2, is near the 
lipid-filled centre, set back from the channel of the hexamer outside the 
urea pathway, with one conserved residue. This helix starts more than 
two helix turns earlier than predicted*. On reaching the periplasmic 
side, TMH2 continues for several helical turns until the disordered 
section of PL1, which contains the engineered His, tag. At this point, 
TMH? reaches a height above the bilayer that is similar to the height of 
PL2. TMH3 is situated inwards of TMH2 and is a major contributor to 
the urea pathway with four conserved residues in the channel, includ- 
ing Phe 84 and Tyr 88, which form part of two constriction sites in the 
channel (Figs 2 and 4). TMH3 contacts TMH2 of the same protomer as 
well as the cytoplasmic half of TMH2 from a neighbouring protomer. 
TMH4, although part of the repeating structural hairpin motif with 
TMH3, barely contacts TMH3 and instead is in close contact with 
TMHS. TMH4, PL2, TMHS, cytoplasmic loop 3 and TMH6 constitute 
the outer, bilayer-facing edge of the HpUrel hexamer. TMH5 con- 
tains five tryptophan residues, three of which (Trp 146, Trp 149 and 
Trp 153) are highly conserved. These line the urea path, with the 
aromatic side chains of Trp 149 and Trp 153 being the main compo- 
nents of the two constriction sites (Figs 2b and 4). In the middle of the 
membrane, the side chain of Glu177 of TMH6 is in a hydrophobic 
environment and is thus predicted to have an elevated pK, of 6.7 and to 
be protonated at pH5.3, the pH of crystallization. This residue is 
conserved only in the Urel family of AmiS/Urel members (Sup- 
plementary Fig. 12), implying a role in urea transport. It is located to 
the side of the channel and in a suitable position to hydrogen-bond to 
urea. Beyond the C-terminal end of TMH6, the last five residues of 
HpUrel are not helical but bend back towards PL2, where the 
C-terminal segment is tucked under PL2. 

There is probably no high-affinity urea-binding site in the channel, 
because there is no saturation up to 100 mM urea when expressed in 
oocytes*, and the half-saturation concentration in proteoliposomes 
has been reported to be 163 mM (ref. 10). Although neither structur- 
ally nor mechanistically related, DvUT from Desulfovibrio vulgaris, a 
homologue of the mammalian urea transporters, shows half-saturation 
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Figure 3 | Structural conservation of a two-helical hairpin motif. The 
internal three-fold axis of pseudosymmetry is oriented perpendicular to the 
bilayer (black line) and is the result of a threefold tandem non-inverted repeat 
ofa helical hairpin motif (TMH pairs 1 and 2, 3 and 4, and 5 and 6). In this side 
view, one HpUrel protomer (gold ribbon) has been overlaid on itself by 
rotations of 120° (green ribbon) and 240° (white ribbon) around the three-fold 
axis through the centre of the channel pore. The root mean squared deviations 
for pairwise overlays of the backbone atoms are 1.52 A for TMH1 and TMH2 
onto TMH3 and TMH4; 1.13 A for TMH3 and TMH4 onto TMHS5 and TMH6; 
1.76 A for TMH5 and TMH6 onto TMHI1 and TMH2, and 1.98 A when all are 
superimposed simultaneously. Structural conservation is stronger for the 
cytoplasmic half of the channel. Analysed with the program SymD”. 
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Figure 4 | Views of the HpUrel channel traversed by urea. a, Side view with 
periplasm on top, showing the shape of the channel in yellow and residues 
conserved in the Urel family (Supplementary Fig. 12). Constriction site 1 is 
formed largely by the side chains of Leu 6, Phe 84 and Trp 149. On the other 
side of Glu 177, constriction site 2 is defined by the side chains of Leu 13, Thr 87, 
Tyr 88, Leu 152 and Trp 153. TMH1 and TMH2 have been removed for clarity. 
Blue arrows indicate the positions of cross-sections shown in b. b, Close-ups of 
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concentrations of 2.3mM urea’. However, it is constructed en- 
tirely differently, featuring a long (16-A), narrow hydrophobic channel 
with inverted two-fold symmetry also found in the mammalian urea 
transporter UT-B™. Both of these contain phenylalanine residues at the 
entry and exit points of the channel. 

HpUrel has much larger periplasmic loops than other members of 
the AmiS/Urel superfamily, and proton-gated closure of the channel, 
affected by mutations of histidine residues or carboxylates in the peri- 
plasmic domain", could be due to loop occlusion of the periplasmic 
vestibule. A similar effect has been shown for the FocA formate chan- 
nel'®, but here histidine protonation closes, rather than opens, the 
channel. For HpUrel at pH5.3, PL1 is disordered as a result of its 
engineered Hisg insertion, and PL2 is oriented away from the channel 
(Fig. 1a), suggesting an open conformation for unimpeded periplasmic 
entry of urea into the periplasmic vestibule. Urea enters through this 
irregularly shaped vestibule, which begins roughly at the height of the 
bilayer edge, defined by a set of largely hydrophobic side chains includ- 
ing Leu 2, Tyr 76, Trp 142 and Trp 146 (Fig. 2b). Next on its way to the 
cytoplasm, urea must pass through two constrictions, one on each side 
of Glu 177. Constriction site 1 on the periplasmic side of Glu 177 is 
defined by the side chains of conserved Leu6, Phe 84 and Trp 149 
(Figs 2b and 4b). Just beyond Glu 177 is constriction site 2, defined 
by the side chains of conserved Leu 13, Thr 87, Tyr 88, Leu 152 and 
Trp 153 (Figs 2b and 4b). Trp 153 has its aromatic plane oriented 
perpendicular to the channel axis, restricting the passage of solutes, 
and is therefore a candidate for a functional mutation (Supplementary 
Fig. 8). Its indole side chain can reorient about 72 because of minimal 
steric restraints, enabling the passage of urea and other amides. Similar 
flexibility of other aromatic side chains in the channel would allow 
transient n-7 stacking of their side-chain surfaces or NH,-7 electro- 
static interaction with planar urea to permit selective passage, while at 
the same time preventing acidification of the cytoplasm by blocking 
the passage of protons or hydronium. The constrictions in the middle 
third of the protomer are probably the selectivity filter, which discrimi- 
nates tenfold between urea and thiourea in oocyte studies’. On the cyto- 
plasmic side of constriction site 2, the channel widens again, with Tyr 104 
and the hydrogen-bonded pair of Asn 16 and Asn 33 at the beginning ofa 
funnel-shaped vestibule, allowing the exit of urea into the cytoplasm. 
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constriction sites 1 and 2 viewed from the periplasm. c, Channel activity of 
mutants: mutating Trp 153 to Ala or Phe (constriction site 2) changes selectivity 
from 4.2:1 (urea:thiourea) for the wild type to 1.5:1 and 1.2:1 for the Ala or Phe 
mutants, respectively. Mutations at Tyr 88 and Trp 149 inactivate or severely 
impair the channel, whereas mutation Phe 84—Leu retains discrimination 

(n = 10-20; error bars indicate s.e.m.). d, Western analysis showing nearly 
equivalent expression for the different mutants. 
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Previously, mutations of protonatable residues in the periplasmic 
domain identified residues important for proton sensing or gating’. 
Here we tested the effect of mutations Phe 84—Leu and Trp 149—Phe 
in the periplasmic constriction site 1 (Fig. 4b) and Tyr 88—Phe and 
Trp 153—Ala/Phe in the cytoplasmic constriction site 2 (Fig. 4b) on 
the discrimination between urea and thiourea (Fig. 4c), a property 
common to HpUrel, DvUT and UT-B***"*. The rate of uptake of urea 
or thiourea was measured in Xenopus oocytes. Western analysis con- 
firmed equal expression levels of mutant and wild-type proteins 
(Fig. 4d). Phe 84—Leu retained the discrimination but with decreased 
uptake. In the cytoplasmic constriction site, Trp 153—Ala and 
Trp 153—>Phe retained urea transport but lost much of the discrim- 
ination between urea and thiourea (Fig. 4c). The selectivity for urea 
over thiourea is therefore largely determined by Trp 153 in the cyto- 
plasmic constriction site 2 of the channel. The buffering capacity gen- 
erated by proton activation of HpUrel coupled with urea hydrolysis by 
the very high levels of urease was estimated on the basis of purified 
HpUrel (Supplementary Information)’’. 

Channel opening also correlates with the acid-dependent recruit- 
ment of urease to the cytoplasmic side of the membrane, dependent on 
HpUrel and essential for survival at pH 3.0 (ref. 18). About two-thirds 
of the urease is recruited to the plasma membrane on the basis of post- 
sectioning immunoelectron microscopy at pH 5.0 (ref. 19), confirmed 
by SDS gel analysis of the membrane fraction of H. pylori, which also 
demonstrates activation of the membrane-recruited urease”’. Measure- 
ment of cytoplasmic pH changes in wild-type and ure! deletion mutants 
also showed that HpUrel increases the bilayer permeability of CO, 
and NH;/NH,", providing a physiological role for this association 
by allowing rapid backflow of these molecules to the periplasm’’, thus 
increasing the rate of periplasmic buffering. These data indicate that a 
conformational change in the membrane domain of HpUrel may occur 
with acidification that is transmitted to its cytoplasmic surface. 

In contrast with other neutral-solute membrane channels, such as 
aquaporins” and mammalian urea transporters’*, which are con- 
structed of two homologous antiparallel halves that generate a long, 
narrow channel, the parallel three-fold pseudo-symmetric architecture 
of HpUrel defines a previously unobserved shorter hourglass-shaped 
channel with a central urea filter and no inverted symmetry. Thus, 
although both HpUrel and mammalian urea transporters homologous 
to DvUT selectively allow the passage of urea, the three-dimensional 
structures and mechanisms of selectivity are distinct. 

This first three-dimensional structure of a channel from the AmiS/ 
Urel superfamily reveals a previously unobserved fold with a unique 
channel architecture able to filter polar organic solutes selectively. As 
the structure of a validated target for H. pylori eradication it may guide 
the discovery of small-molecule inhibitors, providing the possibility of 
monotherapy without the use of conventional antibiotics. 


METHODS SUMMARY 

Crystallization and structure determination. HpUrel was expressed in Escheri- 
chia coli cells, then purified and crystallized in a mixture of octylglucoside and 
decylmaltoside, supplemented with E. coli polar lipids (Avanti). The structure was 
solved using SeMet MAD phasing, followed by threefold non-crystallographic 
averaging, solvent flattening, phase extension, model building and refinement. 
Functional assays. Blue native gel electrophoresis indicated a native hexameric 
structure for HpUrel. The relative rates of transport of urea and thiourea were mea- 
sured as uptake of tracer in CRNA-injected Xenopus oocytes with various mutations. 


Full Methods and any associated references are available in the online version of 
the paper. 
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Engineering a His, tag into HpUrel. DNA encoding HpUrel was isolated by 
PCR from Helicobacter pylori strain J99. A His. tag was introduced into the protein 
at various locations to facilitate purification. The engineered proteins with a His, 
tag at the N terminus, in the first periplasmic loop (PL1), the second periplasmic 
loop (PL2) or at the C terminus were expressed in Xenopus oocytes and tested for 
channel activity. Wild-type HpUrel showed urea uptake of 29.04 + 1.35 pmol per 
oocyte per 10min (mean + s.e.m.; 1 =9) and the PL1 His, tag (after Gly 61) 
showed an uptake of 22.36 + 0.60 pmol per oocyte per 10 min (mean ~ s.e.m,; 
n= 8). Mutants with the His, insertion at the N terminus, in PL2 or at the C ter- 
minus, were inactive. HpUrel with the His, tag inserted in PL1 (HpUreI6HisPL1) was 
subsequently used for expression, purification and crystallization. 

HpUrel expression and membrane isolation. Small-scale expression. pET101- 
UreI6HisPL1 was transformed into E. coli C43 (Avidis S.A.). For small-scale 
expression and crude membrane isolation, bacterial cultures were grown to 
Doo = 0.8 and then induced by the addition of 1 mM isopropyl B-p-thiogalactoside 
(IPTG). After 3h of induction, cells were harvested by centrifugation at 1,500g 
for 10 min. The pellet was resuspended in a solution of 50 mM Na,HPO, pH 7.4, 
1mM EDTA, 30pgml~’ DNaseI and sonicated to lyse the cells. Cell debris 
was removed by centrifugation (10,000g, 10 min) and membranes were collected 
(100,000g, 45 min) and resuspended in the same buffer without DNase I (40-50 ul). 
Samples of membrane protein (25 1g, determined by bicinchoninic acid assay 
(Pierce Biotech, Inc.)) were dissolved in gel loading buffer containing 1% 2- 
mercaptoethanol and run without boiling on 4-12% SDS-polyacrylamide gels. 
After transfer to nitrocellulose, western blot analysis was performed with either 
an anti-Urel or an anti-His-tag antibody (GE). 

Large-scale membrane expression for crystallization trials. A bioreactor (BioFlo 
110; New Brunswick) containing 101 Luria-Bertani broth supplemented with 
50mM K,HPO, pH7.8 and 1.5% (w/v) glycerol was inoculated with 0.21 of 
overnight culture of the E. coli harbouring a pET101HpUreI6HisPL1 plasmid. 
When Dgoo reached 0.8-1.0 (about 2h), HpUrel expression was induced with 
1mM IPTG. Growth was maintained at 37°C, with a continuous air supply at 
51min ' and stirring at 500 r.p.m. for 5h until the cell density reached a Deoo of 
5-7. Cells were then collected by centrifugation and resuspended at 4 °C in 400 ml 
of buffer containing 50mM Na,HPO, pH7.4, 30ugml ' DNase and 10mM 
2-mercaptoethanol. All subsequent steps were conducted at 4 °C. The bacterial 
suspension was passed twice through a French press cell at 20,000 lb in’ *. The cell 
debris was removed (10,000g, 20 min) and the remaining membranes were col- 
lected (100,000g, 2 h) to yield about 4 g of total membrane protein. The membrane 
pellet was resuspended in 350 ml of storage buffer (10 mM imidazole, 150 mM 
NaCl, 50 mM Na,HPO, pH 7.4, 65 g1 | glycerol and 2mM mercaptoethanol) to 
yield 10-12 mgml_' membrane protein, which was then flash-frozen in liquid 
nitrogen and stored at —80 °C. 

Labelling with selenomethionine. The pET101Urel6HisPL1 construct was trans- 

formed into the methionine auxotroph strain BL21-CodonPlus-RP-X strain 
(Stratagene) for optimal labelling of HpUrel with selenomethionine in minimal 
medium. A 10-1 culture of M9 minimal medium (60g1~’ Na,HPO,, 30g1"’ 
KHPO,, 10gl' NH,Cl, 5gl-* NaCl, 0.4% (w/v) glucose, 0.1mM CaCh, 
2mM MgSO,, 0.11 MEM vitamin solution (Mediatech)) supplemented with 60 pg 
ml * selenomethionine, was inoculated with 200 ml of overnight culture grown in 
the same minimal medium with 0.1mgml~' methionine in place of seleno- 
methionine. Protein expression was induced with 1mM IPTG at Dgoo ~ 0.8, 
and growth was continued for 40h at 20°C. 
HpUrel purification. Solubilization and metal affinity chromatography. 
Membranes in storage buffer were collected at 100,000g for 90 min, resuspended 
at 10mg ml! in a buffer of 10mM imidazole, 150 mM NaCl, 50mM sodium 
phosphate pH 7.4, and partly dissolved in 2% decylmaltoside (DM; Anatrace) by 
adding a 20% detergent solution drop by drop and stirring on ice for 30 min. The 
insoluble material was pelleted (160,000g, 30 min) and the supernate was gently 
agitated overnight with 5 ml of cobalt-agarose resin (TALON resin; Clontech) at 
4 °C to bind the Hisg-Urel. All the solutions used in subsequent rinsing and elution 
of the resin contained 150 mM NaCl, 50 mM sodium phosphate pH 7.4, 0.2% DM 
and 0.1mgml’ E. coli polar lipids (Avanti) with increasing concentrations of 
imidazole. The resin was rinsed with ten volumes of buffer with 10 mM imidazole, 
then 30 volumes with 20 mM imidazole. HpUrel was then eluted with 30 ml of the 
same buffer with 250mM imidazole. The eluted HpUrel was concentrated to 
about 10 mg ml ' by using Amicon 50-kDa filters before gel filtration. 

Size exclusion. HpUrel was purified on a Superose 12 column (10/300; GE) with 
buffer containing 10 mM MES pH 6.5, 150mM NaCl, 0.2% DM and 0.1 mg ml * 
E. coli polar lipid extract. The peak fractions were pooled and concentrated to 
10mg ml ! (50-kDa filters; Amicon) for use in crystallization trials. 

HpUrel crystallization. HpUrel protein at 10mg ml! in Superose 12 column 
buffer was diluted to give a solution comprising 1.57mgml_' HpUrel protein, 
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40mM NaCl, 1mM TICI, 10mM CaCh, 7% PEG 400, 0.05% decylmaltoside, 
2.25% octylglucoside, 0.8 mg ml ' E. coli polar lipids (Avanti) and 35 mM MES 
pH5.3. This mixture (3.5 pl) was used for hanging-drop diffusion over a reservoir 
(0.5 ml) of 20% PEG 400 in 0.1 M MES pH5.3. Crystals grew in 3-4 months at 
11°C and were dehydrated by raising the PEG 400 concentration in the reservoir 
in increments of 3% at 2-day intervals until the final PEG 400 concentration in the 
well solution was 33%. The crystal used for collection of the native data set is 
shown in Supplementary Fig. 1. 

X-ray data collection and data reduction. Single crystals were mounted in nylon 
loops and flash-cooled in liquid nitrogen. X-ray diffraction data collection was 
conducted at 100K by collecting 180 diffraction images 1° in width. Data were 
integrated, scaled and merged with the program XDS” (Supplementary Table 1). 
Because of the large variability of the c-axis length (135-152 A), data could not 
be merged between multiple crystals, making experimental phasing more chal- 
lenging. After diffraction data from hundreds of heavy-metal-soaked crystals had 
been collected without yielding interpretable maps, we resorted to selenomethio- 
nine (SeMet) phasing, despite the low abundance of methionine in HpUrel J99 
(three residues out of 195, counting the N-terminal methionine). Although fluore- 
scence scans of the initially very small SeMet crystals indicated significant incorpo- 
ration of selenium, selenium sites could not be located in more than 50 MAD and 
single-wavelength anomalous dispersion data sets collected with 30° wedges. 

The successful MAD data set was collected from a rare larger crystal that took 
more than 5 months to grow. Three-wavelength MAD diffraction data were 
collected in 10° wedges at beamline 12-2 at SSRL (Supplementary Table 1). 
SeMet MAD phasing. The program SHELXC” was used to determine the appro- 
ximate resolution cutoff for the anomalous signal (when the correlation coefficient 
between the anomalous differences at the peak and remote wavelengths decreased 
to below 30%). SeMet sites were initially obtained using the program SHELXD™. 
Of the nine possible sites (three protomers in the asymmetric unit with three 
SeMet sites per protomer, including all three N-terminal SeMet residues), eight 
sites were located. From the symmetry of these eight sites, a ninth site could be 
located, leading to the localization of all selenium anomalous scatterers. The 
boundary of the HpUrel hexamer and individual helices were readily apparent 
in the resulting low-resolution maps; however, unsurprisingly, detail of the maps 
and phasing power was poor. To improve the phases, the program autoSsHARP” 
was used. The figure of merit from autoSHARP phasing was used to determine the 
high-resolution cutoff for the experimental phases. A cutoff of 0.3 for the figure of 
merit of acentric reflections was used for this estimation, suggesting useful phases 
to about 4.7 A resolution (Supplementary Table 2). 

Phase improvement and extension. To improve and extend the initial experi- 
mental MAD phases, solvent flattening, three-fold non-crystallographic symmetry 
(NCS) and multi-crystal averaging of data from crystals with different c-axis 
lengths were performed. To obtain initial matrices for NCS averaging, 18 ideal 
helices were modelled into the experimental electron density representing the 
three molecules in the asymmetric unit. Subsequently the NCS matrices from 
molecule A to molecule B and from molecule A to molecule C were obtained by 
the SSM superpose function of the program COOT”*. A mask was placed around 
the electron density of molecule A and threefold NCS averaging and solvent flat- 
tening (68.9% solvent) were performed with the program DM”, while NCS mat- 
rices were refined. Phases were extended in an iterative fashion from 4.5 A to 3.5 A 
by using small resolution increments with the programs DM and DMMulti’’”. At 
this point the right-handed twist of the helices and density for some of the larger 
side chains could be seen. 

Model building and refinement. Density representing each of the six transmem- 
brane helices was cut out with the program PHENIX”. An ideal helix was built 
into this density, slightly curved if required, using the sequence representing each 
helix. Helix 6 contains a -bulge and was built by hand with the program COOT. 
Cytoplasmic loops 1 and 2 (CL1 and CL2) and periplasmic loop 2 (PL2) were 
initially modelled into density with the program RAPPER”. In places where the 
correct sequence could not be built initially, alanine was used temporarily. To 
generate the other two molecules of the asymmetric unit, the previously identified 
NCS matrices were used, followed by rigid-body refinement with the program 
PHENIX. The model was refined iteratively by cycles of manual adjustments with 
the program COOT and refinement with the program PHENIX, using TLS refine- 
ment with NCS and secondary structure restraints. 

On analysing the data with the University of California Los Angeles anisotropy 
server”, the data were found to be severely anisotropic with an effective resolution 
of 3.1 A in the a* and b* directions, but only 3.5 A in the c* direction. The data 
were ellipsoidally truncated and rescaled to minimize the inclusion of poor dif- 
fraction data. The model was refined with the newly truncated data by using jelly 
body refinement with the program REFMAC”™, leading to significantly improved 
electron density maps that allowed further model improvement. Because of 
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disorder, no model was built for the majority of periplasmic loop 1 (PL1, residues 
59-73), which contains the engineered Hisg insertion. 

A final round of refinement was performed with the program REFMAC against 
the non-truncated data with two TLS groups in each protomer (residues 1-146 
and 147-195), tight NCS restraints and a jelly body value of 0.01. The final 
structure was refined to 3.26 A with few Ramachandran outliers, small deviations 
from ideal geometry and predominantly preferred side-chain rotamers (Sup- 
plementary Table 1). 

SeMet anomalous difference maps. To validate the sequence assignment of the 
HpUrel crystal structure, additional methionine residues were engineered into 
various sites in the HpUrel sequence and labelled with SeMet. These proteins were 
crystallized and their anomalous differences Fourier maps were inspected. For all 
five engineered sites, the location of the anomalous difference peak was less than 
1 A from the site of the methionine sulphur atom of the final model. The sites were 
Tle 14, Ala 148, Thr 155, Leu 173 and Ile 191, in addition to the endogenous sites 
Met 1, Met 14 and Met 127, the last of these being situated in PL2. 

Estimation of the number of HpUrel molecules per bacterial cell by western 
blotting. Bacterial proteins were size-fractionated by SDS-tricine polyacrylamide 
gel electrophoresis and electroblotted onto a nitrocellulose membrane, followed 
by immunodetection by enhanced chemiluminescence (Amersham). The anti- 
HpUrel antibody used for detection and quantification was generated in rabbit 
against PL1 between TMH2 and TMH3 (CEGAEDIAQVSHHLTSFYGPATG)”. 
Immunoblots were digitized (ScanMaker i800; Microtek) at a resolution of 
600 d.p.i. The scanned images were analysed with Kodak 1D software and the 
amount of HpUrel was quantified with purified HpUrel as the reference. There are 
5.3ng of HpUrel per tg total protein (Supplementary Fig. 2) in 5.4 X 10° cells 
(measured by colony-forming units). The molecular mass of HpUrel is 21.7 kDa. 
There are therefore 0.24pmol of HpUrel per 5.4 X 10° cells; 0.24 pmol of 
HpUrel X 6.0 X 107°/(5.4 X 10° cells) is equivalent to about 27,000 channels 
per cell. 

Urease activity assay. Bacteria grown on trypticase soy agar (TSA) plates were 
suspended in 1 ml of 1 mM phosphate buffer to a final Doo of 0.01. Urease activity 
was measured radiometrically. Bacteria were added to 100 mM sodium phosphate 
buffer containing 5 mM KCl, 138 mM NaCl, 0.5 mM MgCl, 1 mM CaCl,, 10 mM 
glucose and 5mM ["*C]urea with a specific radioactivity of 10 1Cipmol ‘. To 
measure activity in the closed and open states, the pH of the buffer was set to 
pH 4.5 and 7.4 by mixing various amounts of 100 mM monobasic sodium phos- 
phate and 100 mM dibasic sodium phosphate to the desired pH. The pH of the 
buffer during the course of the experiment did not change by more than 0.1 pH 
units. Plastic wells containing 500 mM KOH-soaked filter paper hung from rubber 
stoppers were used to collect the total '*CO, from the hydrolysis of urea. Urease 
activity was measured for 30 min at 37 °C with constant agitation. The reaction 
was terminated with 2.5M H)SO, to release all '*C as CO, and incubated for 
a further 30min at 37°C. The wells were placed in scintillation cocktail 
(HilonicFluor; Packard Instruments), and the radioactivity was measured by scin- 
tillation counting (1216 RackBeta; LKB Instruments). Protein concentration was 
determined by the bicinchoninic acid method (Pierce). Urease activity was about 
2.3 umolmin ' mg’, sufficient to hydrolyse all entering urea®. 

Urea flux estimation. To estimate the urea flux per channel per second, the total 
urease activity of H. pylori cells was measured under HpUrel-mediated flux con- 
ditions. On the assumption that all urea entering the cell is hydrolysed by the large 
amount of cytoplasmic urease (about 8-10% of total protein) and knowing the 
number of HpUrel molecules per cell (27,000; see above), the lower bound of the 
flux was estimated to be about 200 urea molecules per HpUrel channel per second 
at 5 mM medium urea, calculated from the rate of urea hydrolysis and the number 
of channels per organism. 

This is a tenfold higher influx of urea than that due to unfacilitated diffusion 

across phospholipid bilayers (4 X 10-°s”') (ref. 33) and it agrees with the tenfold 
increase in urease activity observed when HpUrel is open. This calculation does 
not take into account the roughly fourfold increase in urease activity when incu- 
bated at acidic pH (ref. 6); hence the rate could be as low as 1,000 st. 
NH; production due to urea influx through HpUrel. Using the calculated flux 
rate and the number of channels per cell from above it was estimated that at least 
5.4 X 10° urea molecules are transported through HpUrel per cell per second at 
gastric urea concentrations. At pH 5.0, with HpUrel fully open, this can generate 
about 30 mM NH; per second neutralizing capacity in a bacterium with a volume 
of 0.6 fl. 


Blue native gel electrophoresis of HpUrel. E. coli membranes were treated with 
2% n-dodecyl-f-b-maltopyranoside (DDM) and the soluble fraction was bound to 
TALON resin. After washes with buffer containing 20 mM imidazole, HpUrel was 
eluted with a buffer containing 150mM imidazole, 50 mM sodium phosphate 
pH7.4, 150mM NaCl and 0.01% DDM. The eluate was concentrated on 
Amicon filters with 10-kDa cutoff to about 5 mg ml |. Protein sample (5 ul) 
was mixed with 2 X sample buffer (100 mM Tricine, 30mM Bis-Tris pH7.0, 
30% glycerol, 0.02% DDM). The cathode buffer was 50mM Tricine, 15 mM 
Bis-Tris pH 7.0, and the anode buffer was 50 mM Bis-Tris pH 7.0. The electro- 
phoretic shift of proteins was achieved by including Coomassie blue G250 dye in 
the cathode buffer. The running conditions were as follows: initial concentration 
of Coomassie blue G250 was 0.02% in the cathode buffer and the gel was run for 
30 min at 100 V and for a further 2 h at 200 V. The concentration of dye was then 
decreased to 0.002% and the gel was run for a further 2 h at 300 V. High-molecular- 
mass markers (catalogue number 17-0445-01; Amersham), BSA and trypsin inhi- 
bitor (Sigma) were mixed with the same sample buffer (Supplementary Fig. 3). 
Solute uptake experiments in Xenopus laevis oocytes. Genes encoding wild- 
type HpUrel (H. pylori strain 43504) and His,-tagged HpUrel (H. pylori strain J99 
with the His, tag in PL1) were cloned into the pcDNA3.1 (Invitrogen) and pET101 
(Invitrogen) vectors, respectively. For the experiments on site-specific mutants, 
substitutions were introduced into the gene encoding His,-tagged HpUrel with the 
use of the QuickChange method (Stratagene). Capped and poly(A)-tailed RNA 
(cRNA) was prepared with the mMessage mMachine T7 Ultra Kit (Ambion). 
cRNA (50nl, lpg ult) was injected into each oocyte (Ecocyte BioScience). 
Oocytes were incubated for 2 days at 18°C in Barth’s solution (88 mM NaCl, 
24mM NaHCO;3, 1mM KCl, 0.4mM CaCl, 0.3mM Ca(NO3),., 0.8mM 
MgSOu,, 10 mM HEPES-Tris pH 7.5). Oocytes were transferred to new vials con- 
taining 0.5 ml of Ringer’s solution (100 mM NaCl, 2mM KCl, 1 mM CaCh, 1 mM 
MgCl,) buffered by 20mM MES at pH5.0 for comparison between urea and 
thiourea uptake through the open HpUrel channel. The transport reaction was 
started by the addition of 100 uM ["*C]urea or 100 1M ['4C] thiourea. After 10 min 
of incubation at 20°C the solution was aspirated to terminate transport and the 
oocytes were washed twice with ten volumes of ice-cold radioisotope-free Barth’s 
solution. Oocytes were transferred to individual scintillation vials and dissolved in 
10% SDS before the addition of scintillation cocktail. 

Western analysis of mutant expression levels. Ten oocytes were solubilized by 
pipetting into buffer containing 0.5 ml of 0.1 M NaCl, 0.02 M Tris-HCl pH 7.6 and 
1% Triton X-100. The yolk was spun down by centrifugation at 20,000g for 5 min 
and the supernatant was aspirated with care, to minimize contamination with 
floating lipid. Supernatant (20 il) was size-fractionated by SDS-tricine polyacry- 
lamide gel electrophoresis and electroblotted onto a nitrocellulose membrane 
followed by immunodetection with alkaline phosphatase (Promega). The anti- 
HpUrel antibody used for detection was generated in rabbit against PL1 between 
TMH2 and TMH3 (CEGAEDIAQVSHHLTSFYGPATG). All mutants were 
expressed at similar levels (Fig. 4d). 
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Drosophila NOMPC is a mechanotransduction 
channel subunit for gentle-touch sensation 


Zhiqiang Yan'*, Wei Zhang'*, Ye He', David Gorezyca', Yang Xiang’, Li E. Cheng’, Shan Meltzer’, Lily Yeh Jan' & Yuh Nung Jan! 


Touch sensation is essential for behaviours ranging from environ- 
mental exploration to social interaction; however, the underlying 
mechanisms are largely unknown’. In Drosophila larvae, two types 
of sensory neurons, class III and class IV dendritic arborization 
neurons, tile the body wall. The mechanotransduction channel 
PIEZO in class IV neurons is essential for sensing noxious mech- 
anical stimuli but is not involved in gentle touch’. On the basis of 
electrophysiological-recording, calcium-imaging and behavioural 
studies, here we report that class III dendritic arborization neurons 
are touch sensitive and contribute to gentle-touch sensation. We 
further identify NOMPC (No mechanoreceptor potential C), a 
member of the transient receptor potential (TRP) family of ion 
channels, as a mechanotransduction channel for gentle touch. 
NOMPC is highly expressed in class III neurons and is required 
for their mechanotransduction. Moreover, ectopic NOMPC expres- 
sion confers touch sensitivity to the normally touch-insensitive class 
IV neurons. In addition to the critical role of NOMPC in eliciting 
gentle-touch-mediated behavioural responses, expression of this 
protein in the Drosophila S2 cell line also gives rise to mechanosen- 
sitive channels in which ion selectivity can be altered by NOMPC 
mutation, indicating that NOMPC is a pore-forming subunit of a 
mechanotransduction channel. Our study establishes NOMPC as a 
bona fide mechanotransduction channel that satisfies all four cri- 
teria proposed for a channel to qualify as a transducer of mechanical 
stimuli* and mediates gentle-touch sensation. Our study also sug- 
gests that different mechanosensitive channels may be used to sense 
gentle touch versus noxious mechanical stimuli. 

Similar to vertebrates, Drosophila displays mechanosensation such 
as gravity sensing, hearing, proprioception, mechanical nociception 
and gentle-touch sensation* "’. To identify sensory neurons for gentle 
touch we used a previously established behavioural assay'’. In brief, 
one side of the larval thoracic segments was gently touched with an 
eyelash and the behavioural responses were scored (Supplementary 
Fig. 1 and Supplementary Methods). By expressing the tetanus toxin 
light chain (TNT) in different classes of dendritic arborization neurons 
to interfere with synaptic transmission, we found that the behavioural 
response to gentle touch was reduced by TNT expression in class III 
(Fig. la, b) but not class IV dendritic arborization neurons (Sup- 
plementary Fig. 2). In all of the behavioural studies we used a single 
copy of upstream activation sequence (UAS)-TNT and a single copy of 
the yeast Gal4 driver, which may not have been sufficient to completely 
eliminate the synaptic output of class III dendritic arborization neu- 
rons. Alternatively, the residual larval behavioural response to gentle 
touch may indicate the involvement of additional sensory neurons. 

As dendrites of five class III dendritic arborization neurons tile 
70-80% of each abdominal hemisegment’* (Supplementary Fig. 3), a 
Drosophila larva could use these neurons to detect gentle touch on 
most of its body. Extracellular recordings of class III neurons in abdo- 
minal segments of fillet preparations (Supplementary Fig. 4) revealed 
that a single touch causing a displacement of the body wall by as 
little as 10 m—about one-hundredth of the width of a third-instar 


larva—induced a burst of action potentials (Fig. 1c). Progressively 
larger displacements elicited greater numbers of action potentials 
in a graded fashion (Fig. 1d, i). In addition, a single touch causing a 
40-um displacement induced a large calcium response in class III 
neurons (Fig. le). Similar to the five class III dendritic arborization 
neurons in each abdominal hemisegment, class III neurons in thoracic 
segments showed touch responses (Supplementary Fig. 5). By contrast, 
the same touch stimulation had no effect on class IV dendritic arbor- 
ization neurons that are mechanical nociceptive neurons*'* (Sup- 
plementary Fig. 6). 

Class III neurons adapted quickly to a prolonged stimulus with a 
time constant of 153 ms, and showed an off response upon the removal 
of the stimulus in most cases (Fig. 1d and Supplementary Fig. 7), 
similar to touch-sensitive mechanoreceptors in Caenorhabditis elegans 
and mice!!*!”, Thus, class III dendritic arborization neurons are low- 
threshold mechanoreceptors for sensing gentle touch. 

To identify the mechanotransduction channel that mediates gentle 
touch, we examined the role of NOMPC, which is important for hearing 
and mechanotransduction in adult Drosophila and larval locomo- 
tion’*'**", Our immunostaining of Drosophila larvae confirmed high 
levels of NOMPC expression in the ciliate tips of type I sensory 
neurons'’®'*** (Supplementary Fig. 8b), and we found high levels of 
NOMPC throughout the soma and dendrites of class II dendritic 
arborization neurons of wild-type but not nompC null mutant larvae 
(Fig. 1f and Supplementary Fig. 8a, c). We further uncovered severe 
defects of nompC null mutant larvae in the behavioural response to 
gentle touch, which could be rescued by expressing NOMPC in class III 
neurons (Fig. 1g, h), indicating that NOMPC functions cell autono- 
mously in these neurons to mediate gentle-touch sensation. Moreover, 
class III neurons of the nompC mutant larvae failed to respond to 
touch stimuli with action potential firing (Fig. 1i) or calcium response 
(Fig. 1j, k) despite their normal morphology (Supplementary Fig. 9), a 
defect that could also be rescued by class III neuron expression of 
NOMPC (Fig. li-k). Two Drosophila TRP channels, Inactive (AV) 
and Nanchung (NAN), are interdependent in their contribution to 
the hearing of adult Drosophila*. We found normal mechanosensitivity 
of class III dendritic arborization neurons in iav null mutant larvae 
(Fig. 1i, k), which showed a normal behavioural response to gentle 
touch (Fig. 1g, h). Thus, NOMPC, but not IAV, is required for mechan- 
otransduction of larval class III dendritic arborization neurons that 
sense touch. 

We next tested whether touch sensitivity can be conferred to class 
IV dendritic arborization neurons—which normally do not express 
NOMPC (Fig. 2a and Supplementary Fig. 10)—by expressing NOMPC 
ectopically (Fig. 2b). Although class IV neurons with the ppk-Gal4 
driver alone were insensitive to touch, class IV neurons with 
NOMPC expression responded to the touch stimulation with a burst 
of action potentials (Fig. 2c—e) and an increase in internal calcium level 
(Fig. 2f-h). Thus, not only is NOMPC necessary for mechanosen- 
sitivity of class III neurons, but its ectopic expression is sufficient to 
enable the normally touch-insensitive class IV neurons to respond 
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Figure 1 | NOMPC is required for mechanotransduction of class III 
dendritic arborization neurons that function in gentle-touch sensation of 
larvae. a, Silencing class III neurons with TNT expression impaired larval 
behavioural responses to gentle touch (1 = 24-26). WT, wild type. b, Score 
distribution of different genotypes. 0, no response; 1, pause; 2, recoil; 3, single 
reverse contractile wave; 4, multiple reverse contractions. c, Touching the 
dendrite field of a class III dendritic arborization neuron to cause a 10-l1m 
displacement induced a burst of action potentials. d, Class III dendritic 
arborization neurons responded to prolonged mechanical stimulation, with an 
on response displaying rapid adaptation as well as an off response. e, Internal 
calcium increase in a class III dendritic arborization neuron in response to a 40- 
uum touch displacement lasting 300 ms. Arrowhead indicates stimulus onset. 
f, Immunostaining of NOMPC in ddaF and ddaA class III dendritic 
arborization neurons (labelled by class III-neuron-specific 19-12-Gal4-driven 
GFP). Arrowhead marks a dendritic segment with NOMPC expression in 
dendritic spikes, as shown at high magnification on the right. Scale bars: left, 50 
uum; right, 5 jm. g, nompC mutant was defective in gentle-touch sensation; this 
behavioural phenotype could be rescued by expressing NOMPC in class III 
dendritic arborization neurons (nompC rescue), but not by UAS-NOMPC 
(UAS Ctrl) or Gal4 (Gal4 Ctrl) alone. nompC mutant, Gal4 Ctrl and UAS Ctrl 
were compared to both 19-12-Gal4 and nompC rescue for the significance test. 
iav mutant showed a normal behavioural response to gentle touch compared 
with wild-type control (n = 20-24). h, Score distribution of different 
genotypes. i, Summary of action potential firing of class III dendritic 
arborization neurons in the nompC mutant, 19-12-Gal4, nompC rescue, and 
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Figure 2 | Ectopic NOMPC expression confers mechanosensitivity to the 
normally touch-insensitive class IV dendritic arborization neurons. a, Class 
IV neurons showed no detectable NOMPC expression, whereas class III 
neurons nearby displayed NOMPC staining (indicated by arrowheads). 
b, Class IV neurons with ectopic expression of NOMPC showed strong 
NOMPC staining. Scale bars in a and b, 50 tum. ¢, f, Class IV neurons did not 


show detectable electrophysiological (c) or calcium (f) response to mechanical 
stimuli. d, g, Class IV neurons with NOMPC expression displayed 
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iav mutant in response to increasing mechanical stimuli. y axis denotes 
increase in the number of action potentials (APs) in 1s after stimulus onset 
compared to 1s before stimulus onset. nompC mutant showed markedly 
reduced responses compared to 19-12-Gal4 and nompC rescue, whereas the iav 
mutant showed no difference from 19-12-Gal4 (P> 0.2; n = 7-10). j, The 
calcium response of class III dendritic arborization neurons in nompC mutant 
(green) and nompC rescue (red). Arrowhead indicates the onset of the stimulus 
(40-,1m displacement lasting for 300 ms). k, Group data of calcium response of 
class III dendritic arborization neurons with different genotypes (n = 7-10). 
We used unpaired t-test for comparison between two groups, and one-way 
analysis of variance followed by Tukey’s comparison for comparison among 
three or four groups. NS, not significant. **P < 0.01, ***P < 0.001. All error 
bars denote + s.e.m. Genotypes are as follows: for a, b, wild-type control: wills. 
TNT control: UAS-TNT/+. Gal4 control: repo-GAL80/+; 19-12-GAL4/+. 
Class II silenced: repo-GAL80/UAS-TNT; 19-12-GAL4/+. For g, h, 
19-12-Gal4: repo-GAL80/+; 19-12-GAL4/+, nompC mutant: nompC'/nompC’; 
+/+. Gal4 control: nompC'/nompC’; 19-12-GAL4/+. UAS control: nompC'/ 
nompC’, UAS-nompC; +/+.nompC rescue: nompC'/nompC’, UAS-nompC; 19- 
12-GAL4/+. Wild-type control: w'!8 jav mutant: iav’. Fori, 19-12-Gal4: 19-12- 
GAL4, UAS-CD8-GFP/+. nompC mutant: nompC'/nompC’ ; 19-12-GAL4, UAS- 
CD8-GFP/+. nompC rescue: nompC'/nompC’, UAS-nompC; 19-12-GAL4, 
UAS-CD8-GFP/+. iav mutant: iav'; +/+; 19-12-GAL4, UAS-CD8-GFP/+. For 
j,k, 19-12-Gal4: 19-12-GAL4/UAS-GCaMP5. nompC mutant: nompC'/nompC’; 
19-12-GAL4/UAS-GCaMP5. nompC rescue: nompC'/nompC’, UAS-nompC; 19- 
12-GAL4/UAS-GCaMP5. iav mutant: iav'|y; +/+; 19-12-GAL4/UAS-GCaMP5. 
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touch-induced action potential firing (d) and calcium rise upon 40-1m touch 
displacement lasting 300 ms (g). The same mechanical stimuli were used in 
f and g. Arrowheads indicate stimulus onset. e, h, Group data of 
electrophysiological (e) and calcium (h) response to mechanical stimulation of 
class IV neurons from control larvae and larvae with NOMPC expression. 
ppk>NOMPC denotes the UAS-NOMPC/+; ppk-Gal4/+ genotype. All error 
bars denote + s.e.m. n = 8-10. **P < 0.01, ***P < 0.001, unpaired t-test. 
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to touch. Overexpression of NOMPC also conferred touch sensitivity 
to the normally touch-insensitive class I dendritic arborization neu- 
rons (Supplementary Fig. 11). 

Next, we tested whether NOMPC could form mechanosensitive 
channels in a heterologous expression system. Although there is strong 
evidence supporting the notion that TRP-4, the C. elegans homologue 
of Drosophila NOMPCG, is a pore-forming subunit of a native mechan- 
otransduction channel”’, it remains to be tested whether heterologous 
expression of these channels gives rise to mechanosensitive channels”. 
We found that Drosophila S2 cells transfected with NOMPC displayed 
spontaneous channel openings in whole-cell recordings (Fig. 3a and 
Supplementary Fig. 12). This current could be blocked reversibly by 
gadolinium (Gd*") and SKF-96365 (Fig. 31 and Supplementary Fig. 
13a, b), which are known to block many TRP channels. The current 
could also be blocked by FM1-43, a blocker for mechanotransduction 
channels on hair cells** (Fig. 3n). 
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We then tested whether NOMPC channels could be gated by mech- 
anical force. First, by using a glass probe driven by a piezo actuator to 
deliver mechanical stimuli to the cell membrane during whole-cell 
recording, we observed both on and off responses (Fig. 3b). As expected 
for mechanically gated channels with fast activation (Fig. 3b), with 
increasing stimulation strength the current amplitude increased pro- 
gressively (Fig. 3c) whereas the latency decreased to less than 1.5 ms 
(Fig. 3d). Gd?* and EM1-43 blocked these mechanosensitive channels, 
and the channel activity recovered after washout (Fig. 3m, 0). 

Next, we applied negative pressure through the recording pipette 
with a high-speed pressure clamp to stretch the membrane of an 
outside-out excised patch, and found that the NOMPC channel was 
activated within 2 ms of the pressure application (Fig. 3e). As the 
pressure was gradually increased, the current amplitude increased 
progressively (Fig. 3f-h). The pressure-induced NOMPC current 
was also sensitive to Gd°* and FM1-43 (Supplementary Fig. 13c, d). 
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Figure 3 | Heterologous expression of NOMPC in S2 cells yields 
mechanosensitive channels. a, Representative traces of spontaneous channel 
activities in control and NOMPC-GFP-expressing S2 cells at —60 mV. 

b, Mechanosensitive current triggered by the piezo-actuator-driven probe 
causing a 3.2-1m displacement of the cell membrane. Insets, the latency is less 
than 2 ms (arrows indicate the onset and the end of the stimulation for the on 
and off response, respectively). c, The current amplitude (at —60 mV) varied 
with the size of the displacement (5 trials with 0.4-j1m increment, from 0 to 
1.6 pm). d, The latency of the current response as a function of the mechanical 
displacement (n = 7). Error bars denote + s.e.m. e, Mechanosensitive current 
of an outside-out patch in response to negative pressure applied with a high- 
speed pressure clamp. Arrow indicates the pressure onset. f, The 
mechanosensitive current amplitude varied with the pressure intensity (at 
—60 mV). g, Surface plot showing the current amplitude on a single patch 
increased progressively with increasing negative pressure at —60 mV. h, Group 
data of the dose-dependent curve of pressure-induced current (n = 7). Error 
bars denote + s.e.m. i, Single-channel current (at —60 mV) from 
spontaneously active channels (top trace) and channels activated by 
mechanical stimulation via probe displacement (middle trace) and negative 


pressure on excised patch (bottom trace). j, Histogram showing the increase of 
channel open probability after stimulation. Black denotes displacement- 
triggered channel activity and grey denotes spontaneous channel activity. c, 
closed; 0, open. k, Comparison of the single-channel current amplitude of 
spontaneously active channels and pressure-induced channels on a patch from 
the same cell at —60 mV. mech. curr., mechanicosensitive current; spon. curr., 
spontaneous current. n = 7; paired t-test; error bars denote + s.e.m. 

1, Spontaneously active channels in S2 cells expressing NOMPC-GFP were 
sensitive to 100 LUM Gd** and the block was reversible upon washout (n = 6). 
NP, indicates open probability. m, The displacement-induced 
mechanosensitive current was blocked by 100 1M Gd** and then recovered 
after washout (black, control; grey, Gd?*; red, washout; n = 4). 

n, Spontaneously active channels in S2 cells expres- 

sing NOMPC-GFP were sensitive to 3 1M FM1-43 and the block was reversible 
upon washout ( = 6). 0, The displacement-induced mechanosensitive current 
was blocked by 3 uM FM1-43 and then recovered after washout. Black, control; 
grey, FM1-43; red, washout; n = 5. ***P < 0.001, paired t-test. All error bars 
denote = s.e.m. 
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By resolving the single-channel current amplitude for mechano- 
sensitive channels (Fig. 3i, j), we found that the single-channel current 
amplitude for channels activated by negative pressure is identical 
to that for the spontaneously active channels (Fig. 3i, k). Reversal 
potential measurements under bi-ionic conditions further revealed 
that NOMPC channels are permeable not only to monovalent cations 
such as Na*, K~ and Cs”, but also the divalent cation Ca”* (Sup- 
plementary Fig. 14). 

A previous study has shown that mutations in several residues of the 
selective filter of the TRP-4 channel in C. elegans change the ion 
selectivity of the channel”. On the basis of the sequence homology 
between NOMPC and TRP-4 channels, we mutated acidic residues 
near the putative pore region of NOMPC”*”® (Fig. 4a). Whereas 
alanine substitution of Asp 1516 had no detectable effects on channel 
properties including the single-channel conductance (Fig. 4b), repla- 
cing Glu 1511 with the basic residue lysine considerably reduced chan- 
nel activity (Fig. 4c). Alanine substitution of Glu 1511 resulted in a 
reduction of single-channel conductance measured at —60mV in 
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Figure 4 | NOMPC is likely a pore-forming subunit of the 
mechanotransduction channel. a, Sequence alignment of the putative pore 
region of NOMPC and its homologues, putative pore helix (black bar) and the 
putative selective filter (grey bar). Arrows mark the negatively charged 

Glu 1511 and Asp 1516 residues targeted for mutagenesis. A. suum, Ascaris 
suum, X. tropicalis, Xenopus tropicalis; D. rerio, Danio rerio. b, The E1511A 
mutation altered the single-channel conductance of the spontaneous current 
determined at —60 mV from whole-cell recording in extracellular Na“/ 
intracellular Cs* solution (n = 7, 6, 14 from left to right). ***P < 0.001, 
unpaired t-test. c, The E1511A mutation altered the single-channel J-V curve of 
spontaneously active channels in the same Na“ /Cs* solutions. The E1511A 
mutant channel displayed outward rectification and the E1511K mutation 
greatly reduced channel activity. d, Sample traces of wild-type and mutant 
channels in response to 2.8-,1m displacement at — 100 (dark grey), 0 (black) and 
100 (grey) mV. e, The E1511A mutation altered the I-V curve of displacement 
(2.8 m)-triggered currents in the same Na*/Cs* solutions whereas the 
E1511K mutation largely eliminated the mechanosensitive current at negative 
voltage. f, The E1511A mutation altered the I-V curve of pressure-elicited 
currents in the same Na‘ /Cs" solutions whereas the E1511K mutation largely 
eliminated the mechanosensitive current (50 mm Hg) at negative voltage. 

g, The E1511A mutation shifted the reversal potential of spontaneously active 
NOMPC channels (black bars) and NOMPC channels activated by negative 
pressure (50 mm Hg, grey bars) in the bi-ionic Na‘/Cs* conditions (n = 6, 6, 6, 
4 from left to right). All error bars denote + s.e.m. 
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the shape of current-voltage relationship (I-V) curves (Fig. 4c and 
Supplementary Fig. 16). These mutant NOMPC channels showed 
similar distributions on the cell membrane as wild-type NOMPC 
(Supplementary Fig. 17). Notably, these mutations had similar effects 
on the spontaneously active channels and the channels activated by 
mechanical stimulation. The current response to piezo displacement at 
—100 mV was reduced in the E1511A mutant, whereas the response 
was largely eliminated in the E1511K mutant (Fig. 4d, e). The muta- 
tions also altered the shape of the I-V curves for the current induced by 
both mechanical displacement (Fig. 4e) and negative pressure (Fig. 4f) 
and shifted the reversal potential of the mechanosensitive channels in 
Na*/Cs™ solutions (Fig. 4g). The wild-type NOMPC channels exhi- 
bited a slightly higher permeability (P) to Na* than Cs* (Pya+/ 
Po,+ = 1.2), whereas the converse was true for the E1511A mutant 
(Pna+/Pcs+ = 0.7). These results support the notion that NOMPC is 
a pore-forming subunit of the mechanotransduction channel. 

For a channel to be considered a bona fide transducer of mechanical 
stimuli, it has to meet four criteria*””. NOMPC satisfies all of these 
criteria: first, it is expressed in class III dendritic arborization neurons 
that respond to touch; second, it is required for touch-induced excita- 
tion of class III neurons; third, ectopic expression of NOMPC endows 
the normally touch-insensitive dendritic arborization neurons with 
sensitivity to mechanical stimuli; and fourth, heterologous expression 
of NOMPC in S82 cells generates non-selective cation channels. 

We show that Drosophila NOMPC is most likely a pore-forming 
subunit of a channel that can be activated by mechanical force in a 
heterologous expression system and is required for mechanotransduc- 
tion of gentle touch in vivo. Together with Drosophila PIEZO, which 
is gated by mechanical force in a heterologous expression system and 
is required for sensing noxious mechanical stimuli in vivo*?*, these 
studies reveal that the class III and class IV dendritic arborization 
neurons that tile the larval body wall use different molecular mechan- 
isms to sense gentle touch and noxious mechanical stimuli, respec- 
tively, raising the question of whether different groups of sensory 
neurons in the skin of other organisms including mammals might 
also use different molecular mechanisms for sensing these different 
stimuli types. 


METHODS SUMMARY 

Behavioural assay. Animals were raised at 25 °C in an incubator. The thoracic 
segments of the larvae were touched with an eyelash and the behavioural responses 
were scored as described in ref. 12. 19-12-Gal4 is a marker for class III dendritic 
arborization neurons and is not expressed in class I, II or IV neurons””°. 
Electrophysiological recordings. Fillet preparations were made by dissecting 
third-instar larvae. Action potentials were recorded extracellularly. Whole-cell 
and outside-out patch recordings of S2 cells were carried out 1-2 days after 
transfection. 

Calcium imaging of dendritic arborization neurons. Calcium imaging of larval 
dendritic arborization neurons was carried with a newly available genetically 
coded calcium indicator, GCaMP5. 

Mutagenesis of NOMPC channel and 82 cell culture. All point mutations were 
introduced by site-directed mutagenesis and verified by sequencing of the full- 
length construct. Drosophila S2 cells were transfected using the product protocol. 
Mechanical stimulation. A glass probe was driven by a piezo actuator mounted 
on a micromanipulator to give mechanical stimulation. The movements were 
triggered and controlled by the piezo amplifier, which was synchronized with 
the programmed signals from pClamp software. Negative pressure on excised 
patch was delivered by high-speed pressure clamp. 

Immunohistochemistry. In brief, third-instar larvae were dissected in PBS, fixed 
and exposed to the primary antibody overnight at 4°C and then the secondary 
antibody for 2h at room temperature (25 °C). 
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Metabolic control of adult neural stem cell activity by 
Fasn-dependent lipogenesis 
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Mechanisms controlling the proliferative activity of neural stem 
and progenitor cells (NSPCs) have a pivotal role to ensure life-long 
neurogenesis in the mammalian brain’. How metabolic programs 
are coupled with NSPC activity remains unknown. Here we show 
that fatty acid synthase (Fasn), the key enzyme of de novo lipo- 
genesis’, is highly active in adult NSPCs and that conditional dele- 
tion of Fasn in mouse NSPCs impairs adult neurogenesis. The 
rate of de novo lipid synthesis and subsequent proliferation of 
NSPCs is regulated by Spot14, a gene previously implicated in lipid 
metabolism**, that we found to be selectively expressed in low 
proliferating adult NSPCs. Spotl4 reduces the availability of 
malonyl-CoA‘, which is an essential substrate for Fasn to fuel lipo- 
genesis. Thus, we identify here a functional coupling between the 
regulation of lipid metabolism and adult NSPC proliferation. 

Adult neurogenesis, the generation of new neurons throughout 
adulthood in discrete brain regions, is critically involved in certain 
forms of cognition and has been associated with neuropsychiatric 
diseases such as depression and epilepsy’’”. Recently, mechanisms 
regulating cell metabolism have been shown to control haematopoietic 
stem cell activity as well as proliferation of certain cancer cells*"’. 
Whether regulation of cell metabolism governs cell proliferation in 
mammalian NSPCs remains unknown. Therefore, we here studied 
the role of de novo lipogenesis and its key enzyme Fasn''”” in the 
context of adult NSPC biology. 

Fasn messenger RNA and protein were expressed in the two main 
neurogenic areas of the adult brain, the subgranular zone (SGZ) of the 
dentate gyrus and the subventricular zone (SVZ) lining the lateral 
ventricles (Fig. la and Supplementary Fig. la—h). Fasn expression 
was high in proliferating NSPCs but downregulated in vitro after dif- 
ferentiation (Supplementary Fig. 2a). Proliferating NSPCs showed 
high levels of Fasn enzymatic activity relative to other dividing 
neural cells, such as non-myelinating Schwann cells, or to NSPC- 
derived differentiated progeny (Supplementary Fig. 2b, c). High levels 
of Fasn activity were not due to impaired uptake of fatty acids, as 
NSPCs were competent to pick up free fatty acids in substantial 
amounts (Supplementary Fig. 2d). Mass-spectrometry-based metabo- 
lomics of dividing NSPCs, proliferating non-myelinating Schwann 
cells and differentiated progeny of NSPCs revealed striking differences 
in the metabolic profile of these discrete cell populations, confirming 
that NSPCs are in a distinct metabolic state (Fig. 1b). 

Pharmacological inhibition of Fasn activity using two independent 
inhibitors of Fasn (orlistat and cerulenin) reduced proliferation of 
rodent NSPCs in a dose-dependent manner (Fig. 1c and Supplemen- 
tary Fig. 3a, c, e) without inducing substantial amounts of cell death 
after short-term treatment (Supplementary Fig. 3b, d). In contrast, 
proliferation of non-myelinating Schwann cells was not affected by 
Fasn inhibition (Supplementary Fig. 3f). However, prolonged inhibition 


of Fasn enhanced apoptotic cell death, suggesting that Fasn activity 
is also required for NSPC maintenance (Supplementary Fig. 3d). 
To genetically delete Fasn, we isolated NSPCs from mice carrying 
floxed alleles of the Fasn gene (Fasn®*°%)!2_ Retrovirus-based Cre- 
recombinase-mediated excision of Fasn phenocopied the effects of 
pharmacological Fasn inhibition and reduced proliferation of NSPCs 
(Fig. 1d and Supplementary Fig. 4a-c). 

We next conditionally deleted Fasn specifically in adult NSPCs in 
vivo by crossing Fasn'*/"°* mice with mice harbouring tamoxifen- 
inducible nestin-driven Cre recombinase and yellow fluorescent pro- 
tein (YFP) reporter alleles in the ROSA locus (Fasn-cKO (conditional 
knockout))'*”*. When analysed 40 days after the last tamoxifen injec- 
tion, the number of recombined, YFP-expressing cells was drama- 
tically reduced in both neurogenic areas in Fasn-cKO mice 
compared to their respective controls (Fig. le, f and Supplementary 
Fig. 4d). Phenotyping of YFP-labelled cells in the dentate gyrus 
revealed that the defects observed after conditional Fasn deletion 
manifested themselves early in the neurogenic lineage with a reduction 
in YFP-labelled cells occurring already at the stage of radial NSPCs and 
resulting in a dramatic loss of newly generated neurons (Fig. 1g, h 
and Supplementary Fig. 5a, b). These results show that adult NSPCs 
require high levels of de novo lipogenesis for proper neurogenesis. 

We next sought to identify genes that could regulate de novo lipo- 
genesis in adult NSPCs and consequently alter proliferation of NSPCs. 
One gene that has previously been implicated as a context-dependent 
modulator of de novo lipogenesis is Spot14 (also known as thyroid 
hormone responsive protein (Thrsp))*>. In situ hybridization revealed 
that Spotl14 mRNA expression was highly confined to the two neuro- 
genic areas of the adult brain (Fig. 2a and Supplementary Fig. 6a—c, e). 
To study Spot14-expressing cells in more detail we used a bacterial 
artificial chromosome (BAC) transgenic mouse driving green fluore- 
scent protein (GFP) under the control of the regulatory elements of the 
Spot14 gene (Spot14-GFP). In agreement with the mRNA expression, 
Spot14-GFP-expressing cells (hereafter called Spotl4* cells) were 
largely restricted to the SGZ with only scattered expression outside 
the dentate gyrus, unlike previously described NSPC markers such as 
nestin, which also labels a substantial number of cells outside the adult 
neurogenic niches (Fig, 2a and Supplementary Fig. 6d, f). Spot14* cells 
in the dentate gyrus showed a radial and non-radial cell morphology“, 
and co-labelled with NSPC markers such as Sox2 (92.7 + 15%), glial 
fibrillary acidic protein (Gfap; 85.3 + 4.2% in radial cells) and nestin 
(73.4 + 5.7% in radial cells) (Fig. 2b and Supplementary Fig. 6g, h)*. 
Of all Spotl4* cells, only 9.36 + 1.1% were dividing. Non-radial 
Spot14* cells showed higher proliferation rates (7.86 + 1.0%) com- 
pared to radial Spotl4* cells (1.7 + 0.5%), as measured by Ki67 co- 
labelling (Fig. 2b and Supplementary Fig. 6i), indicating that the 
majority of Spot14* cells are not dividing under resting conditions. 


1Brain Research Institute, Faculty of Medicine, University of Zurich, 8057 Zurich, Switzerland. “Institute of Molecular Health Sciences, Department of Biology, Swiss Federal Institute of Technology (ETH) 
Zurich, 8093 Zurich, Switzerland. “Institute of Molecular Systems Biology, Department of Biology, ETH Zurich, 8093 Zurich, Switzerland. *Department of Cell and Developmental Biology, Max Planck 
Institute for Molecular Biomedicine, 48149 Muenster, Germany. ®Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland. "Washington University School of 


Medicine, Division of Endocrinology, Metabolism & Lipid Research, St. Louis, Missouri 63110, USA. 


*These authors contributed equally to this work. 


226 | NATURE | VOL 493 | 10 JANUARY 2013 


©2013 Macmillan Publishers Limited. All rights reserved 


b 


Z score 


Orlistat 
BrdU* compared to control (%) 


10 


Fasn-cKO 


| 5 | 
“: 


Figure 1 | NSPCs are in a distinct metabolic state and require Fasn- 
dependent lipogenesis for proliferation. a, In situ hybridization shows 
expression of Fasn within the neurogenic dentate niche. Fasn protein is 
expressed in nestin-GFP * cells. Insets show high-power views of Fasn- 
expressing cells with 4’,6-diamidino-2-phenylindole (DAPI; grey). GCL, 
granule-cell layer. b, Metabolomics show a distinct metabolic state of 
proliferating dentate gyrus and SVZ NSPCs (mDG prol and mSVZ prol) 
compared to proliferating Schwann cells (Schwann prol), non-proliferating 
Schwann cells (Schwann nprol), and differentiated progeny of hippocampal 
and SVZ NSPCs (mDG diff and mSVZ diff). Note the close clustering of 
distinct cell populations. c, Pharmacological inhibition of Fasn using orlistat 
leads to a dose-dependent reduction of hippocampal NSPC proliferation. 
BrdU, 5-bromodeoxyuridine. d, Genetic deletion of Fasn decreases 
proliferation of adult hippocampal NSPCs. Con, control; Cre, transduction of 


No. of YFP* cells per DG 


5 


Confirming the relative quiescence of Spot14* cells, we found that out 
of all Ki67~ cells within the dentate gyrus, only 5.1 + 0.7% were posi- 
tive for Spot14-GFP, whereas in the nestin-GFP mouse, 46.6 + 3.7% of 
all Ki67” cells were GFP positive (Supplementary Fig. 6)). 

To show directly that cells expressing Spotl4 possess neurogenic 
stem-cell potential in the adult brain we generated a BAC transgenic 
mouse expressing tamoxifen-inducible Cre under the regulatory 
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Cre-expressing virus to delete Fasn. Left panel, cells that were transduced 
with a red fluorescent protein (RFP)-expressing control virus and 
Cre-GFP-expressing retrovirus, and pulsed with BrdU. e, Conditional Fasn 
deletion in NSPCs reduces hippocampal neurogenesis. Shown are YFP- 
expressing cells in controls and mutant mice (Fasn-cKO). f, Quantification of 
YFP-labelled cells in the dentate gyrus (DG) 5 days and 40 days after tamoxifen 
(TAM)-induced recombination. g, Phenotyping of recombined, YFP-labelled 
cells reveals that conditional Fasn deletion reduces the number of radial 
glia-like NSPCs expressing Sox2 and Gfap 40 days after tamoxifen and leads to 
a substantial decrease in differentiated YFP-labelled cells. h, Quantification of 
phenotypes after conditional Fasn deletion. Error bars represent mean = s.e.m. 
Scale bars represent 100 um (a top panel, e), 40 um (a bottom panel), 50 pm 
(c, d) and 20 jm (g). *P < 0.05, #P = 0.055. 


elements of the Spotl4 gene (Spotl4-CreER'’). Spotl4-CreER™ 
mice were crossed with a ROSA reporter line expressing YFP upon 
Cre-based recombination, enabling in vivo lineage tracing of Spot14- 
expressing cells (Supplementary Fig. 7a). Soon after tamoxifen injec- 
tions (after 10 days) we found YFP labelling of radial (Sox2* Gfap™) 
and non-radial (Sox2* Gfap~) NSPCs in the dentate gyrus as well as 
very few astrocytes (Fig. 2c, d). At later time points after tamoxifen 
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Figure 2 | Spot14 is expressed in adult hippocampal NSPCs. a, In situ 
hybridization shows restricted expression of Spot14 in the SGZ of the dentate 
gyrus (top panel). Enhanced GFP expression under the regulatory elements 
of the Spot14 gene labels cells in the SGZ (bottom panel). b, Spot14-GFP 
co-labels with NSPC markers Sox2 and nestin. Arrows, GFP and Sox2 
co-labelled cells; arrowheads, GFP and nestin co-labelled radial glia-like 
processes (bottom-left panel). A fraction of Spot14-GFP cells divide. Arrow, a 
Ki67-positive, Spot14-GFP co-labelled cell (bottom-right panel). ¢, YFP 
expression in radial and non-radial NSPCs labelled by Sox2 and Gfap 

10 days after tamoxifen (TAM) injections in Spotl4-CreER™ mice. One 
month after tamoxifen injections, YFP-expressing cells co-label with Dcx, and 
3 months after tamoxifen injections these cells co-label with the mature 
neuronal marker NeuN (bottom-right panel). Note also that radial NSPCs 
were labelled with YFP 3 months after tamoxifen injections (bottom-left 
panel). DAPI is shown in grey. d, Quantification of cellular phenotypes 
generated 10 days, 1 month and 3 months after tamoxifen injections in 
Spot14-CreER™ mice. Astro, astrocytes. Scale bars represent 100 um (a) 

and 20 um (b, c). 


injections (1 month and 3 months later) we found YFP labelling of new 
granule cells expressing doublecortin (Dcx) and the pan-neuronal 
marker NeuN (neuronal nuclei; also known as RNA binding protein 
Foxl homologue 3) besides labelling of radial and non-radial NSPCs, 
both in the dentate gyrus and SVZ (Fig. 2c, d and Supplementary 
Fig. 7b-d). These findings indicate that Spot14-expressing cells cons- 
titute neurogenic NSPCs within the adult brain. 

To further characterize Spot14~ cells on functional and molecular 
levels, we prospectively isolated NSPCs from Spot14-GFP transgenic 
mice and used FACS to separate Spotl4* and Spotl4 NSPCs (Sup- 
plementary Fig. 8a, b). Both Spot14* and Spot14~ NSPCs expressed 
Sox2 and nestin (data not shown), self-renewed over an extended 
period of time in vitro and gave rise to neuronal and glial cells, indi- 
cating multipotency (Supplementary Fig. 8c—e). However, we observed 
reduced proliferation of Spot14* cells compared to Spot14~ cells using 
monolayer cultures and clonal time-lapse imaging (Fig. 3a, b and 
Supplementary Videos 1-4)", suggesting that the low proliferative 
activity of Spot14” cells within the dentate niche is maintained during 
culture conditions, a finding that was also supported by gene expression 
profiling (Supplementary Fig. 9a, b and Supplementary Tables 1-6). 

Given the low proliferation rate of Spot14* cells in vivo and in vitro 
we tested whether manipulation of Spot14 expression levels impacts 
NSPC proliferation. Retrovirus-mediated overexpression of Spot14 
reduced proliferation of cultured NSPCs (Fig. 3c and Supplementary 
Fig. 10a). To test for a function of Spotl4 in NSPC activity in vivo, we 
injected lentiviruses expressing short hairpin RNAs (shRNAs) and the 
fluorescent label mCherry into Spotl4-GFP reporter mice. Knock- 
down of Spotl4 mediated by shRNAs led to a shift in Spot14™ cells 
towards non-radial, more-proliferative NSPCs, compared to control 
cells expressing non-targeting shRNAs (Fig. 3d and Supplementary 
Fig. 10b). 
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Figure 3 | Spot14 regulates proliferative activity of adult NSPCs. 

a, Hippocampal Spot14* cells are less proliferative than Spotl4~ NSPCs. 

b, Time-lapse imaging of single hippocampal NSPCs cultured in hydrogel- 
based microwells reveals reduced proliferation of Spot14™ cells compared to 
Spotl4 NSPCs on a single-cell level. Shown are snapshots every 9h for 
Spot14 cells (top panels) and Spot14~ (bottom panels), and the quantification 
of the number of cell divisions over 81 h. c, Retroviral overexpression of Spot14 
(Spot14OE) reduces proliferation of hippocampal NSPCs. d, shaRNA-mediated 
knockdown of Spot14 (Spot14 shRNA) leads to a shift from radial towards non- 
radial Spotl4* NSPCs compared to a non-targeting, shRNA-expressing 
lentivirus (Con shRNA). Arrows, non-radial NSPCs; arrowheads, radial 
NSPCs. Error bars represent mean = s.e.m. Scale bars represent 40 um (a) and 
100 um (b, d). *P< 0.05. 


We next sought to identify a mechanism by which Spot14 controls 
NSPC proliferation. Spotl4 has been shown to interact with Mig12 
(Mid1-interacting protein 1), an activator of acetyl-coenzyme A 
(CoA)-carboxylase (Acc), the enzyme that catalyses the production 
of malonyl-CoA, which is a substrate for Fasn to generate new pal- 
mitate'®; however, heterodimerization with Spot14 impairs the ability 
of Migl2 to activate Acc, leading to decreased Acc activity’. We 
found expression of Fasn in Spot14™ cells and high levels of Mig12 
in adult NSPCs (Fig. 4a, b and Supplementary Fig. 11a—d). In addition, 
Spotl4 and Mig12 were both localized to the cytoplasm and co- 
immunoprecipitated (Supplementary Fig. lle, f), suggesting that an 
interaction between Spotl4 and Mig12 takes place in adult NSPCs. 
Retroviral overexpression of Migl2 rescued the Spot14-induced 
decrease in NSPC proliferation, indicating that a Spot14—Mig12 inter- 
action is involved in the regulation of NSPC proliferation (Sup- 
plementary Fig. 11g). Furthermore, virus-mediated knockdown of 
Mig12 reduced proliferation of NSPCs within the adult hippocampus 
in vivo and in vitro, as measured by EdU (5-ethynyl-2'-deoxyuridine) 
pulse labelling (Fig. 4c and Supplementary Fig. 11h, i). Using targeted 
liquid chromatography tandem mass spectrometry we found that 
Spot14 overexpression led to a significant drop in malonyl-CoA levels 
without affecting expression levels of Fasn protein (Fig. 4d and 
Supplementary Fig. 10c), suggesting that Spot14 decreases malonyl- 
CoA levels and as a consequence lowers substrate availability for Fasn 
to fuel de novo lipogenesis. To directly assess levels of de novo lipo- 
genesis upon Spot14 overexpression we measured total de novo lipid 
production by quantifying incorporation of radioactively labelled “*C- 
acetate and “C-labelled glucose into lipids. We found that Spotl4 
overexpression significantly decreased levels of lipid synthesis 
(Fig. 4e). These data support previous reports that Spot14 can act as 
a context-dependent modulator of de novo lipogenesis, either as an 
inhibitor* (as shown here for adult NSPCs) or activator of lipid syn- 
thesis**'”. In support of an essential role for Fasn and Spotl4, we 
found that genetic deletion of Fasn or overexpression of Spot14 (both 
leading to reduced de novo lipogenesis) induced a shift in the metabolic 
profile of NSPCs, as determined by principal component analyses of 
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Figure 4 | Spot14 modulates malonyl-CoA levels and de novo lipogenesis. 
a, Fasn protein is expressed in Spot14” cells. Insets show high-power views of a 
Spot14-GFP and Fasn co-labelled cell. b, Mig12 is expressed in adult NSPCs. 
c, shRNA-mediated knockdown of Mig12 in vivo (Mig12 shRNA, right panel) 
reduces proliferation of hippocampal NSPCs. d, Retroviral overexpression of 
Spot14 reduces malonyl-CoA levels in NSPCs. dpm, disintegrations per 
minute; pmol, picomol. e, Radioactive tracing using M4C_acetate and MC- 
glucose shows a decrease in de novo lipogenesis after Spot14 overexpression. 


metabolomics experiments (Supplementary Fig. 12). We next analysed 
the fate of newly synthesized lipids using thin-layer chromatography 
(TLC) and found that the majority of them (91.9%; Fig. 4g) ended up in 
the membrane lipid-containing fractions, suggesting that NSPCs 
require a large part of new lipids for structural membranes. 

We identify here a novel mechanism governing adult neurogenesis 
by showing that adult NSPCs require Fasn-dependent lipogenesis for 
proliferation and that Spotl4, a gene highly enriched in low proli- 
ferating NSPCs, downregulates NSPC proliferation by reducing lipid 
synthesis (Fig. 4g). Thus, Spot14 seems to act as a molecular brake on 
Fasn-dependent lipogenesis, potentially also integrating other signals, 
such as thyroid hormone-regulated pathways, to ensure relative 
quiescence of adult NSPCs. However, future studies will have to 
further dissect and characterize the exact cellular mechanisms and 
NSPC-subtype during which high levels of lipogenesis are required 
for proper proliferation. 

The targeting of metabolic pathways may hold potential for novel 
therapeutic approaches to treat diseases associated with failing adult 
neurogenesis, such as major depression or age-related cognitive decline. 


METHODS SUMMARY 


Mouse strains used were Spot14-GFP (Mutant Mouse Regional Resource Centres; 
MMRRC), nestin-GFP"’, Fasn floxed!?, nestin-CreER’?-R26YFP" and Spot14- 
CreER™*. The Spotl4-CreER' BAC transgenic line was generated according 
to a published protocol’’. All animal experiments were approved by the veterinary 
office of the Canton of Zurich, Switzerland. Cre-mediated recombination was 
induced by five consecutive intraperitoneal injections of tamoxifen (180 mg kg ') 
at the age of 6 to 8 weeks. Retro- and lentiviral constructs, their production and 
stereotactic injections were carried out as described previously”. Brain tissue was 
sectioned and stained using methods published previously*’. Murine NSPCs 
were cultured in DMEM with Ham’s F12, supplemented with N2 supplement 
plus EGF, FGF and heparin. Fasn enzyme activity was determined using an 
assay described previously”. Spotl4* and Spotl4” cells were live sorted and 
collected for RNA extraction, and gene expression analysis was then carried out 
using Affymetrix GeneChip arrays. To measure de novo lipid production in NSPCs 
overexpressing Spot14 and controls, cells were incubated with [1-'*C]acetate or 
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f, Lipid separation by thin-layer chromatography after '4C-acetate labelling 
shows a reduction in de novo synthesized free fatty acids, TAGs 
(triacylglycerols) and polar lipids in Spot14-overexpressing hippocampal 
NSPCs. g, Scheme showing the proposed mechanism: high-proliferating 
NSPCs require Fasn-dependent lipogenesis. Spot14 downregulates NSPC 
proliferation by reducing the Fasn-dependent generation of new complex fatty 
acids (FAs). Error bars represent mean = s.e.m. Scale bars represent 40 Lm 
(a), 20 um (b) and 50 tm (c). *P< 0.05. 
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D-[U-"*C]glucose and the organic phases of the lipid extracts were transferred 
to scintillation vials, and “C and 7H counts were measured. Metabolomics 
samples were analysed by ion pairing-reverse phase liquid chromatography 
tandem mass spectrometry on a Waters Acquity UHPLC coupled to a Thermo 
TSQ Quantum Ultra triple quadrupole instrument. Statistical analyses were 
carried out using two-tailed unpaired t-tests, non-parametric Mann-Whitney 
U-tests, the Kolmogorov-Smirnov test, or analysis of variance (ANOVA) 
followed by Fisher’s post hoc test if appropriate. Significance levels were set at 
P<0.05 (P< 0.01 for array analyses). A detailed methods section is available in 
the Supplementary Information. 
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Analysis of 6,515 exomes reveals the recent origin of 
most human protein-coding variants 
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Establishing the age of each mutation segregating in contemporary 
human populations is important to fully understand our evolu- 
tionary history’” and will help to facilitate the development of new 
approaches for disease-gene discovery’. Large-scale surveys of hu- 
man genetic variation have reported signatures of recent explosive 
population growth* *, notable for an excess of rare genetic variants, 
suggesting that many mutations arose recently. To more quanti- 
tatively assess the distribution of mutation ages, we resequenced 
15,336 genes in 6,515 individuals of European American and 
African American ancestry and inferred the age of 1,146,401 auto- 
somal single nucleotide variants (SNVs). We estimate that approxi- 
mately 73% of all protein-coding SNVs and approximately 86% 
of SNVs predicted to be deleterious arose in the past 5,000- 
10,000 years. The average age of deleterious SNVs varied signifi- 
cantly across molecular pathways, and disease genes contained a 
significantly higher proportion of recently arisen deleterious SNVs 
than other genes. Furthermore, European Americans had an excess 
of deleterious variants in essential and Mendelian disease genes 
compared to African Americans, consistent with weaker purifying 
selection due to the Out-of-Africa dispersal. Our results better 
delimit the historical details of human protein-coding variation, 
show the profound effect of recent human history on the burden of 
deleterious SNVs segregating in contemporary populations, and 
provide important practical information that can be used to prio- 
ritize variants in disease-gene discovery. 

As part of the US National Institutes of Health (NIH) Heart, Lung 
and Blood Institute (NHLBI)-sponsored Exome Sequencing Project 
(ESP), we sequenced the exomes of 6,515 individuals (Supplementary 
Table 1), including 4,298 European Americans and 2,217 African 
Americans. Exome data were subjected to standard quality control 
filters as previously described® (Supplementary Information), resulting 
in a data set of 1,146,401 autosomal protein-coding SNVs with a 
known ancestral state (709,816 and 643,128 in European Americans 
and African Americans, respectively) distributed across 15,336 pro- 
tein-coding genes. To quantitatively estimate the age of each SNV (that 
is, the allele age), we developed a simulation approach to generate a 
series of coalescent trees for a specified demographic model, and esti- 
mated allele age based on a derivation described previously’ (Sup- 
plementary Information). We verified the accuracy and robustness 
of this approach to factors including recombination rate heterogeneity, 
population growth, migration and purifying selection. Extensive coa- 
lescent simulations showed that we could accurately estimate the 
expected allele age from the simulated data, although the variance 
associated with any individual SNV can be large (Supplementary 
Figs 6 and 7). 

We estimated the age of all 1,146,401 SNVs using 6 different demo- 
graphic models***’, 3 of which considered recent explosive population 


growth*°** (Supplementary Table 2). Estimates of allele age were gene- 
rally robust across different demographic models, with the largest dis- 
crepancies resulting in a twofold difference in average age across all 
SNVs (Supplementary Table 3 and Supplementary Fig. 8a). However, 
because most SNVs arose recently (see below), differences among demo- 
graphic models were highly concordant (Supplementary Information). 
Accordingly, we report results based on a modified Out-of-Africa 
model’ in which accelerated population growth began 5,115 years ago 
with a per-generation growth rate of 1.95% and 1.66% for European 
Americans and African Americans, respectively’. 

The site frequency spectrum (SFS) of protein-coding SNVs re- 
vealed an enormous excess of rare variants (Fig. la). Indeed, we 
observed an SNV approximately once every 52 base pairs (bp) and 
57 bp in European Americans and African Americans, respectively, 
whereas in a population without recent explosive growth we would 
expect the SNVs to occur once every 257 bp and 152 bp in European 
Americans and African Americans, respectively (Supplementary 
Information). Thus, the European American and African American 
samples contain approximately fivefold and threefold increases in 
SNVs, respectively, attributable to explosive population growth, resul- 
ting in a large burden of rare SNVs predicted to have arisen very 
recently (Fig. 1b). For example, the expected age of derived singletons, 
which comprise 55.1% of all SNVs, is 1,244 and 2,107 years for the 
European American and African American samples, respectively. 
Overall, 73.2% of SNVs (81.4% and 58.7% in European Americans 
and African Americans, respectively) are predicted to have arisen in 
the past 5,000 years. SNVs that arose more than 50,000 years ago were 
observed more frequently in the African American samples (Fig. 1b), 
which probably reflects stronger genetic drift in European Americans 
associated with the Out-of-Africa dispersal. 

The average age across all SNVs was 34,200 + 900 years (+ s.d.) in 
European Americans and 47,600 + 1,500 years in African Americans, 
and these estimates were robust to sequencing errors (Supplementary 
Information and Supplementary Fig. 9). As expected, SNVs shared 
between European Americans and African Americans were significantly 
older (104,400 years and 115,800 years for European Americans and 
African Americans, respectively) than population-specific variants 
(5,400 years and 15,300years in European Americans and African 
Americans, respectively; Fig. 1c) (t-test; P< 0.00001 by permutation). 
Furthermore, there were large and significant differences among the 
average allele ages of SNVs stratified by functional type (t-test; 
P<0,00001 by permutation). For example, splice site, nonsense and 
non-synonymous SNVs were two to eight times younger than synonym- 
ous and non-coding variants (Fig. 1d). Moreover, we classified amino 
acids into four groups (non-polar and neutral, polar and neutral, acidic 
and polar, and basic and polar), and non-synonymous SNVs that 
resulted in changes between groups were significantly younger than 
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Figure 1 | The vast majority of protein-coding single-nucleotide variants 
arose recently. a, The site-frequency spectrum for European Americans 
(EAs, red) and African Americans (AAs, blue). DAF, derived allele frequency. 
b, Cumulative proportion of SNVs for a given allele age. The inset highlights the 
cumulative proportion of SNVs that are estimated to have arisen in the last 


those that resulted in changes within groups (t-test; P< 0.00001 by 
permutation; Supplementary Fig. 10a). These differences in average 
allele age are probably due to varying intensities of selective constraint 
among different classes of SNVs"’. Consistent with this prediction, we 
observed significantly higher values of the neutrality index, a measure 
of the direction and degree of departure from neutral evolution, in 
genomic regions enriched for younger variants (Spearman’s correla- 
tion; P= 0.004 and P= 0.001 for European Americans and African 
Americans, respectively; Supplementary Fig. 11), indicating a higher 
burden of deleterious SNVs. 

To more directly identify putatively deleterious SNVs, we used four 
functional prediction methods (SIFT, PolyPhen2 (ref. 14), a likelihood 
ratio test’®, MutationTaster'®) applicable to non-synonymous SNVs 
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Figure 2 | Characteristics of allele age for deleterious single-nucleotide 
variants. a, b, Average age of non-synonymous and other SNVs as a function of 
the number of methods that predict the variant to be deleterious. Pie charts 
represent the proportion of SNVs that arose less than (black) or more than 
(white) 5,000 years ago. Error bars represent s.d. c, Relationship between the 
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50,000 years. The x axis denotes allele age (1,000) and the y axis indicates the 
the cumulative proportion of SNVs (%). ¢, Average age for all SNVs, SNVs 
found in both the European Americans and African Americans (shared), and 
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and two conservation-based methods (GERP+ +" and PhyloP"*) ap- 
plicable to all SNVs (Supplementary Information). We found a strong 
inverse relationship between average SNV age and the number of me- 
thods that predicted a variant to be deleterious (Fig. 2a, b). Thus, SNVs 
predicted to be deleterious by multiple methods probably experience 
(on average) more intense purifying selection and may be of particular 
interest in disease mapping studies. The age of non-synonymous SNVs 
predicted to be deleterious by all 6 methods was 3,000 and 6,200 years in 
European Americans and African Americans, respectively, and 88.7% 
were less than 5,000 years old (92.9% and 80.6% in European Americans 
and African Americans, respectively). 

The strengths and weaknesses of functional prediction methods 
vary substantially and as a result the accuracy of any single method 
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is modest’. Therefore, we used a majority rule approach to identify a 
more conservative set of SNVs predicted to be deleterious®. Specifi- 
cally, non-synonymous SNVs predicted to be functionally significant 
by at least four methods and all other SNVs (synonymous, splice and 
non-coding variants) predicted by two conservation-based methods 
were designated as deleterious. In total, 14.4% (164,688) of SNVs, in- 
cluding 152,633 nonsynonymous variants, met these criteria. We 
found that allele age was strongly related to the probability that a 
variant was predicted to be deleterious (Supplementary Fig. 12), with 
the fraction of SNVs predicted to be deleterious diminishing as allele 
age increased (Fig. 2c and Supplementary Fig. 13). The average age of 
conservatively defined deleterious variants was 5,200 + 300 years for 
European Americans and 10,100 + 600 years for African Americans. 
Moreover, 86.4% of these SNVs were predicted to have arisen in the 
past 5,000 years (91.2% and 77.0% for European Americans and 
African Americans, respectively), corresponding to the onset of accele- 
rated population growth (Fig. 3a). In other demographic models, a 
similarly high proportion of deleterious SNVs were predicted to have 
arisen since the onset of accelerated growth rates, with the exact timing 
varying somewhat among models, but always in the timeframe of 5,000 
to 10,000 years (Supplementary Table 3 and Supplementary Fig. 8b, c). 

Moreover, 7,197 (57.4%) of the 12,533 genes in European Americans 
and 4,534 (37.5%) of the 11,607 genes in African Americans that har- 
bour one or more deleterious variants only possess deleterious SNVs 
with an estimated age of less than 5,000 years (Fig. 3b). Thus, recent 
accelerated population growth has had a large influence on the number 
of genes harbouring deleterious variants in contemporary populations. 
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Figure 3 | Distribution of deleterious single-nucleotide variants across the 
exome before and after recent accelerated population growth. a, Heat map 
representation of deleterious SNV density (number of deleterious protein- 
coding variants per Mb) for European Americans (EAs, red) and African 
Americans (AAs, blue). The distributions of deleterious SNVs across the exome 
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Notably, after correcting for exon length of each gene, 3 and 18 genes in 
European Americans and in African Americans, respectively, have a 
significant excess of deleterious variants that arose after the onset of 
recent accelerated growth (P=3 X 10°; Supplementary Table 4), 
including 12 genes that have been associated with human diseases’® such 
as LAMCI (premature ovarian failure*’), LRP1 (Alzheimer’s disease”’), 
CPE (coronary artery atherosclerosis”) and KIAA0196 (hereditary spas- 
tic paraplegia”). 

Next, we investigated the distribution of ages for conservatively 
defined deleterious SNVs in 849 genes that cause Mendelian disorders”, 
2,663 genes associated with complex diseases’’, 1,226 genes considered 
‘essential’ (that is, a mouse knockout associated with lethality or steri- 
lity)? and 11,711 genes classified as ‘other’ (Supplementary Informa- 
tion). The proportions of deleterious SNVs in genes for Mendelian 
disorders (15.9%), essential genes (15.2%) and genes associated with 
complex diseases (15.1%) were each significantly higher (Fisher’s exact 
test, P<10 °°) than the proportion in other genes (14.0%). In the 
European American samples, the proportion of deleterious SNVs did 
not decline monotonically as a function of age for Mendelian and essen- 
tial genes. Instead, the proportion of deleterious variants with an esti- 
mated age of 50,000 to 100,000 years in Mendelian disease genes and 
100,000 to 150,000 years in essential genes were elevated (Fig. 4a). This 
pattern was not observed in the African American samples (Fig. 4a). To 
explore this observation, we carried out simulations to estimate the 
probability that a deleterious SNV survives to the present day as a func- 
tion of when the variant arose, the magnitude of selection and the 
presence or absence of an Out-of-Africa bottleneck (Supplementary 
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before and after recent accelerated population growth are shown in the left and 
right panels, respectively. Numbers on the y axis represent chromosome 
number. b, The bar plots summarize the number of genes segregating one or 
more deleterious SNVs that arose before (left) or after (right) recent accelerated 
population growth. 
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Figure 4 | Heterogeneity of allele age across genes and pathways. 

a, Distribution of the proportion of deleterious SNVs for Mendelian, complex, 
essential and other genes in European Americans (EAs, top) and African 
Americans (AAs, bottom) versus age. Data for each of the four categories of 
genes is shown in each plot, with darker lines representing the specific gene class 


Information). Simulations of deleterious alleles in the presence of a bot- 
tleneck recapitulated the patterns observed in European Americans 
(Supplementary Fig. 14). Specifically, in the presence of a bottleneck, 
weakly deleterious alleles (selection coefficient, s=0.001) have an 
increased probability of survival precisely in the intervals 50,000 to 
100,000 years and 100,000 to 150,000 years. Thus, our simulations sug- 
gest that genes underlying disease and essential genes are more func- 
tionally constrained relative to other genes, and the bottleneck 
associated with the Out-of-Africa dispersal led to less efficient purging 
of weakly deleterious alleles”. 

Finally, we found that the average age of deleterious variants (and the 
proportion of deleterious variants; Supplementary Fig. 15) was signifi- 
cantly different across 235 KEGG (Kyoto Encyclopedia of Genes and 
Genomes) pathways (Kruskal-Wallis rank sum test; P = 0.0025 and 
P<0.00001 for European Americans and African Americans, respec- 
tively; Fig. 4b and Supplementary Information). The average age across 
pathways did not vary significantly when all SNVs were considered 
(Kruskal-Wallis rank sum test; P = 0.259 and P = 0.075 for European 
Americans and African Americans, respectively), indicating that the 
differences observed for deleterious variants probably represent hete- 
rogeneity of functional constraint across pathways. In general, the 
average age of deleterious variants in metabolic pathways was older 
than that in other pathways (Mann-Whitney test, P= 0.0001 for 
European Americans and African Americans, respectively), suggesting 
they are subject to less functional constraint. Conversely, deleterious 
variants in human disease pathways (Mann-Whitney test, P = 0.03 for 
African Americans) and in pathways involved in organismal sys- 
tems were significantly younger than other pathways (Mann-Whitney 
test, P= 0.04 and P=0.002 for European Americans and African 
Americans, respectively). 

In summary, the spectrum of protein-coding variation is consider- 
ably different today compared to what existed as recently as 200 to 400 
generations ago. Of the putatively deleterious protein-coding SNVs, 
86.4% arose in the last 5,000 to 10,000 years, and they are enriched for 
mutations of large effect (Supplementary Fig. 14) as selection has not 
had sufficient time to purge them from the population. Thus, it seems 


indicated by the column label. Shaded regions define 95% confidence intervals 
obtained by bootstrapping. b, Average ages for deleterious (projecting up) and all 
(projecting down) SNVs across 235 KEGG pathways that can be organized into 
six broad classes. Each of the six classes is comprised of multiple sub-classes, 
indicated by the different colour shadings (Supplementary Information). 


likely that rare variants have an important role in heritable phenotypic 
variation, disease susceptibility and adverse drug responses. In prin- 
ciple, our results provide a framework for developing new methods to 
prioritize potential disease-causing variants in gene-mapping studies. 
More generally, the recent dramatic increase in human population 
size, resulting in a deluge of rare functionally important variation, 
has important implications for understanding and predicting current 
and future patterns of human disease and evolution. For example, the 
increased mutational capacity of recent human populations has led to 
a larger burden of Mendelian disorders, increased the allelic and gene- 
tic heterogeneity of traits, and may have created a new repository of 
recently arisen advantageous alleles that adaptive evolution will act 
upon in subsequent generations”. 


METHODS SUMMARY 


Exome sequences were obtained for 6,823 individuals, who were sequenced to high 
coverage (median depth greater than 100X) on an Illumina GAII or HiSeq2000. 
Library construction, exome capture, sequencing, mapping, calling and filtering 
were carried out as described previously, with minor modifications® (see also 
Supplementary Information). After quality control and removal of related indivi- 
duals, 6,515 individuals were retained. Ancestry of each individual was inferred by 
principal component analysis (PCA) performed on the sequence data. We 
developed a simulation approach based on coalescent theory to estimate allele 
age, which was applied to 1,146,401 autosomal SNVs with known ancestral states. 
A complete description of the materials and methods is provided in the Sup- 
plementary Information. 
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Bright radio emission from an ultraluminous 
stellar-mass microquasar in M 31 


Matthew J. Middleton’?, James C. A. Miller-Jones*, Sera Markoff*, Rob Fender*, Martin Henze”, Natasha Hurley- Walker’, 
Anna M. M. Scaife’, Timothy P. Roberts', Dominic Walton®”, John Carpenter’, Jean-Pierre Macquart?®, Geoffrey C. Bower’, 
Mark Gurwell’®, Wolfgang Pietsch°, Frank Haberl°, Jonathan Harris, Michael Daniel’, Junayd Miah!, Chris Done’, John S. Morgan?, 
Hugh Dickinson", Phil Charles*!*, Vadim Burwitz®, Massimo Della Valle'**, Michael Freyberg”, Jochen Greiner?, 

Margarita Hernanz’, Dieter H. Hartmann’©, Despina Hatzidimitriou’’, Arno Riffeser'®, Gloria Sala’’, Stella Seitz!*, Pablo Reig”°, 


Arne Rau”, Marina Orio”, David Titterington”” & Keith Grainge” 


A subset of ultraluminous X-ray sources (those with luminosities 
of less than 10° ergs’; ref. 1) are thought to be powered by the ac- 
cretion of gas onto black holes with masses of ~5-20M >, probably 
by means of an accretion disk”*. The X-ray and radio emission are 
coupled in such Galactic sources; the radio emission originates in a 
relativistic jet thought to be launched from the innermost regions 
near the black hole**, with the most powerful emission occurring 
when the rate of infalling matter approaches a theoretical maxi- 
mum (the Eddington limit). Only four such maximal sources are 
known in the Milky Way’, and the absorption of soft X-rays in the 
interstellar medium hinders the determination of the causal 
sequence of events that leads to the ejection of the jet. Here we 
report radio and X-ray observations of a bright new X-ray source 
in the nearby galaxy M 31, whose peak luminosity exceeded 
10°’ ergs '. The radio luminosity is extremely high and shows 
variability on a timescale of tens of minutes, arguing that the source 
is highly compact and powered by accretion close to the Eddington 
limit onto a black hole of stellar mass. Continued radio and X-ray 
monitoring of such sources should reveal the causal relationship 
between the accretion flow and the powerful jet emission. 

XMM-Newton first detected XMMU J004243.6+412519 on 15 
January 2012 (ref.7) at an X-ray luminosity of 2X 10°° ergs (fora 
distance to M 31 of 0.78 Mpc (ref. 8)), with an X-ray spectrum that 
could be fully described by a hard power-law, characteristic of sub- 
Eddington accretion (mass accretion rates <70% of the Eddington 
limit?"’). The source then rose to >10*’ ergs ' in two subsequent 
detections, fulfilling the traditional definition of an ultraluminous 
X-ray source (ULX; although other definitions exist, the term ULX is 
numerical rather than physically motivated)"’ and significantly above 
the cutoffluminosity of the X-ray luminosity function of M 31 (ref. 12), 
which shows no sources more luminous than 2 X 10°° ergs’. 

At the peak luminosity of (1.26 + 0.01) x 10° ergs ', the X-ray 
spectrum seemed similar to that of Galactic black-hole X-ray binaries 
(BHXRBs) at mass accretion rates close to or above the Eddington 
limit’*. In such cases, the emission is dominated by an optically thick ac- 
cretion disk’ whose spectrum may appear broadened because the accre- 
tion process is no longer radiatively efficient'*. This can be accompanied 


by a second, weaker, thermal component at higher energies, possibly 
due to Compton up-scattering of disk photons in a wind or pho- 
tosphere. These two components are also required to fit high-quality 
spectra of nearby ‘low-luminosity’ ULXs*"*, implying that similar accre- 
tion processes are taking place. Although the intrinsic emission can 
potentially be amplified through geometrical beaming", this is thought 
to be important only for ULXs above 10*’ ergs '. 

After the X-ray outburst, the source was monitored by the Swift and 
Chandra missions for more than 8 weeks (see Fig. 1 and Supplemen- 
tary Table 1), until the source was no longer observable because of the 
Sun angle. The X-ray luminosity decreased slightly to ~10°’ ergs’ ', 
with the spectrum evolving to be fully described by emission from a 
standard accretion disk* with a peak temperature characteristic of 
high-mass-accretion-rate Galactic BHXRBs’. Both the timescale for 
the X-ray spectral variations and the temperature of the disk compo- 
nent rule out an interpretation as a background active galaxy. 

A subsequent period of X-ray monitoring showed the source to 
remain disk-dominated while decaying to a luminosity of ~7 X 10°” 
ergs '. The correlation of luminosity with disk temperature as L x T* 
is a well-established observational tracer of sub-Eddington accretion. 
We find that the best-fitting spectral parameters (see Supplementary 
Table 2) deviated strongly from this correlation when the source was 
brightest: the temperature decreased with luminosity (Supplementary 
Fig. 2). Such behaviour is well documented in ULXs”, strengthening 
our identification of this object as a member of this class, albeit a low- 
luminosity one. 

The X-ray detection of this nearby bright source motivated a series 
of radio observations by the Karl G. Jansky Very Large Array (VLA; see 
Supplementary Table 3), Very Long Baseline Array (VLBA) and 
Arcminute Microkelvin Imager Large Array (AMI-LA). The radio 
light curves are shown in Fig. 2 (observational details are provided 
in Supplementary Information). The source was initially detected by 
the VLA at a highly significant level (>40c), with a 4-50 AMI-LA 
detection the following day. The spectral index (defined by S, x v*) 
measured by the VLA was slightly inverted, at « = 0.27 + 0.16. 

The AMI-LA monitoring demonstrated that the radio emission was 
variable on timescales of days, as confirmed by a detection of strongly 
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Figure 1 | Evolution of the X-ray luminosity and spectrum over time. 

a, Evolution of the source brightness over ~200 days together with the time of the 
first VLA detection (blue vertical dashed line). The unabsorbed 0.3-10 keV 
luminosities (and 90% error bars) are derived from the best-fitting models to the 
observations with XMM-Newton (red), Chandra (blue) and Swift (green) (see 
Supplementary Tables 1 and 2). An upper limit from an XMM-Newton 
observation taken a week before the source brightened is also included. 

b-f, Spectral data (black points with 90% error bars) and best-fitting models (in 
red; see Supplementary Information and Supplementary Table 2) from which the 
unabsorbed luminosities are calculated. b, The first detection before the peak 
brightness shows a spectrum that can be fully described by a hard power-law. c, As 
the source brightens to a peak luminosity of 1.26 X 10° ergs, the spectrum 
evolves to one requiring emission from an optically thick accretion disk 
(dot-dashed blue line) and a second, weaker thermal component (dot-dashed red 
line) that may be related to a wind or photosphere. Such spectra are also seen in 
high-mass-accretion-rate BHXRBs”* and low-luminosity ULXs*"*. d, As the 
source dims to ~10*’ ergs ', the spectrum becomes fully dominated by the disk 
component. e, The spectrum remains disk-dominated but the luminosity 
decreases, similar to BHXRBs decaying after an outburst. f, The most recent 
XMM-Newton observation demonstrates that the source remains disk-dominated 
down to ~7 X 10°” ergs‘, which places an upper limit on the mass" of <17Mo. 


reduced emission by the VLA (Fig. 2) and a non-detection by the Com- 
bined Array for Research in Millimeter-wave Astronomy (CARMA). 
Previous, well-characterized, radio-luminous ULXs have been iden- 
tified with emission from an optically thin, diffuse nebula believed to 
be powered by the bright accretion flow’’. The radio variability from 
XMMU J004243.6+412519 demonstrates that the emission cannot be 
nebular in origin. Furthermore, the unresolved VLBA detection shown 
in Fig. 2 constrains the size of the emitting region to be <1,500 AU. 
This size limit, together with the variability on a timescale of days, 
confirms that the radio emission must originate in a relatively com- 
pact, relativistically outflowing region with a brightness temperature 
>6 X 10°K. 

BHXRBs accreting at sub-Eddington rates produce powerful, per- 
sistent radio jets; the radio and X-ray emission are strongly correlated 
and are linked to black-hole mass through the ‘fundamental plane of 
black hole activity’’’”°. However, the thermal, disk-dominated X-ray 
spectrum after the peak of the outburst implies accretion at a relatively 
high fraction of the Eddington limit'® (3-100%). The fundamental 
plane relation does not hold for sources accreting at such high mass 
accretion rates and so cannot be used to constrain the mass of the black 
hole. At these high accretion rates, the radio emission from BHXRBs is 
instead observed to undergo flaring periods, associated with transitions 
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from hard to soft X-ray spectral states’. Indeed, scaling the radio and 
X-ray flux from this source to be within our Galaxy would give fluxes 
similar to soft, bright, radio-flaring BHXRBs such as GRS 1915+105 
and Cygnus X-3 (ref. 21). The observed X-ray and radio emission of 
this source is therefore fully consistent with analogous behaviour, and 
the radio emission is probably associated with shock acceleration of 
particles within ejections moving away from the black hole” with a 
high Lorentz factor ("= 2; ref. 5). 

The AMI-LA light curve (Fig. 2) suggests multiple ejection events, 
although the observed inverted, optically thick, spectrum is at odds 
with the optically thin emission expected from the later stages of an 
expanding synchrotron-emitting plasma. Either the VLA observed the 
early, self-absorbed stage of a flare or there is additional, free-free ab- 
sorbing material in the environment of the source. 

A search for shorter-timescale fluctuations in the VLA data revealed 
significant variability in the first epoch, on a characteristic timescale of 
several minutes (Fig. 2). Regardless of whether we attribute this to 
intrinsic variability or to scintillation (see Supplementary Informa- 
tion), this timescale implies a source size of only ~5 AU (a few micro- 
arcseconds at the distance of M 31), constraining the emitting region 
to be highly compact. 

There are clear similarities between the behaviour of XMMU 
J004243.6+412519 and the few ‘super-Eddington’ Galactic BHXRBs. 
Of these, the canonical microquasar GRS 1915+105 is the only one 
whose luminosity regularly exceeds 10° ergs‘ and that would some- 
times appear as a ULX to an extragalactic observer’. Multiwavelength 
studies of the disk-jet coupling in GRS 1915+ 105 (ref. 24) have iden- 
tified both ‘plateau’ states with steady jet emission at lower mass accre- 
tion rates and ‘flaring’ states with rapid radio oscillations at accretion 
rates around the Eddington limit, accompanied by an extremely soft 
X-ray spectrum. The latter, minute-timescale”* radio flaring, has been 
observed to occur immediately after a major radio flare, accompanied 
by X-ray brightness changes that are dominated by variability above 
3 keV. If the short-timescale radio variability we observe is intrinsic (see 
Supplementary Information), it could be an analogue of this behaviour, 
although we would not expect to detect the X-ray variability in our 
observations because the flux above 3 keV is extremely low as a result of 
the greater source distance. Although GRS 1915+105 shows clearly 
analogous behaviour, when scaled to a similar distance our source is 
approximately an order of magnitude more luminous in the radio band, 
with a variability brightness temperature for the flares of ~7 X 10'°K 
(see Supplementary Information). Such a discrepancy can readily be 
explained by differences in the inclination angle of the jets to our 
line of sight; GRS1915+105 is highly inclined and thus Doppler- 
deboosted, whereas we may be looking closer to the jet axis in 
XMMU J004243.6+412519. This physical picture is supported by 
comparison with the Galactic BHXRBs, Cygnus X-3 and V4641 Sgr, 
both of which have reached similarly high radio luminosities and pro- 
bably have low angles to the line of sight***’. Our source seems par- 
ticularly well matched to V4641 Sgr, which reached ~0.5 Jy in its 1999 
outburst’’, and has also shown similarly high brightness temperatures. 

The coupled radio and X-ray behaviour in this source is fully con- 
sistent with a picture of a BHXRB rising above the Eddington limit in 
outburst then dimming to a bright sub-Eddington state, with radio 
flaring analogous to that seen in GRS 1915+105 and other BHXRBs’. 
We can therefore make a secure identification of the compact object in 
this source as a black hole of stellar mass (<70M 3). 

Because the source remains in a disk-dominated state down to 
~7 X 10°” erg s |, and the lower limit for such a state is ~3% of the 
Eddington limit’®, this constrains the mass of the black hole to <17M oo, 
However, because the joint radio and X-ray behaviour implies 
Eddington-rate accretion’ at the peak luminosity of ~1.3 x 10°’ ergs‘, 
we favour a more conservative mass estimate of ~10Mo. 

We can also constrain the nature of the companion star from archival 
optical data”’. The field does not contain a source within 3 arcsec (where 
the positional accuracy from the VLBA observations is extremely high; 
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Figure 2 | Summary of the radio observations. a, Multifrequency radio light 
curves of the ULX from our AMI-LA, VLA, VLBA and CARMA data. For the 
AMI-LA data we plot the flux density from the pixel value at the source 
location, because of dynamic range limitations from the bright nearby 
confusing source, 5C 3.111 (Supplementary Fig. 1). The resulting high noise level 
close to the ULX hindered the standard image-plane source fitting methods used 
for the VLA and VLBA data. To estimate the significance of the AMI points, we 
have plotted both the 3-sigma and 5-sigma noise levels (dot-dashed and dotted 
lines, respectively), showing both a standard and a (favoured) more conservative 
detection threshold. b, Radio light curves binned to an interval of 6 min (the 
phase referencing cycle time), showing the short-timescale variability during the 
first VLA observation. There is an additional 1% systematic uncertainty on the 


see Supplementary Information) down to V ~ 25 and B ~ 26, ruling 
out an identification of the companion as an OB star and implying that 
the system is accreting from a star of lower mass, similar to other 
transient, low-luminosity ULXs**°. This also provides further evidence 
that the emission is not associated with a background active galaxy, 
which would be substantially brighter. 

Because the X-ray properties of the source are typical of the class*'*”, 
we confidently extend the identification of Eddington-rate accretion 
onto a black hole of stellar mass in XMMU J004243.6+ 412519, to the 
larger population of low-luminosity ULXs (a substantial component 
of the overall population, with up to 80% having luminosities of 
(1-5) X 10°? erg s ')'. As a result, this implies that we have a large 
population of sources from which to develop models for Eddington 
accretion. In our own Galaxy we are limited to a very small number 
of such sources, and the large absorbing column through the plane of 
our Galaxy prevents a clear view of the Eddington-limited accretion 
flow. However, this is not the case for the vast majority of extragalactic 
sources, allowing the properties of the infalling matter to be accurately de- 
termined. Understanding how the disk and associated outflows behave in 
this regime will allow us to address outstanding cosmological problems 


70 80 90 100 


overall flux scale. We are unable to determine conclusively whether the observed 
variability is intrinsic to the source or whether it arises from scintillation 

in the interstellar medium. Regardless of the mechanism involved, this 
constrains the source size to $6 jas. c, Corresponding spectral indices derived 
from the two VLA observing bands centred at 5.256 and 7.45 GHz. d, VLBA 
image of the source. Colour scale is in Jy per beam, and the contours are set at 
levels of +(/2)” times the root mean squared noise of 21 py per beam, where 
n= 3,4,5. The source is unresolved down to a beam size of 2.0 X 1.2 mas” in 
position angle —26° east of north. The (J2000) source position, measured 
relative to J0038+4137 and before performing any self-calibration, is RA 

00 h 42 min 43.674360(9) s, dec. 41° 25’ 18.7037(1)”. Error bars in all panels 
show the statistical uncertainties at the 1a level. 


including how outflows from quasars redistributed matter and energy in 
the early Universe. 

Assuming that the properties of this source are representative of the 
larger population, we can consider the possibility of detecting similar 
events in future. The sensitivity of the VLA would allow us to detect a 
similarly relativistically beamed event out to 4Mpc, or 0.5 Mpc if 
unbeamed. The X-ray monitoring cadence for M 31 has yielded two 
candidates in 10 years; although the monitoring was not constant, this 
is roughly consistent with the observed rate of Galactic BHXRBs with 
low-mass companions whose outbursts reach the Eddington lumino- 
sity. It is therefore a realistic prospect (see Supplementary Informa- 
tion) that future observations of transient ULX systems in nearby 
galaxies, using sensitive radio telescopes, will permit detailed disk- 
jet coupling studies and in doing so will significantly expand our un- 
derstanding of Eddington accretion and associated phenomena. 
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Topological colloids 


Bohdan Senyuk'”, Qingkun Liu!’, Sailing He*®, Randall D. Kamien’, Robert B. Kusner®, Tom C. Lubensky* & Ivan I. Smalyukh'?° 


Smoke, fog, jelly, paints, milk and shaving cream are common 
everyday examples of colloids’, a type of soft matter consisting of 
tiny particles dispersed in chemically distinct host media. Being 
abundant in nature, colloids also find increasingly important appli- 
cations in science and technology, ranging from direct probing of 
kinetics in crystals and glasses’ to fabrication of third-generation 
quantum-dot solar cells*. Because naturally occurring colloids have 
a shape that is typically determined by minimization of interfacial 
tension (for example, during phase separation) or faceted crystal 
growth’, their surfaces tend to have minimum-area spherical 
or topologically equivalent shapes such as prisms and irregular 
grains (all continuously deformable—homeomorphic—to spheres). 
Although toroidal DNA condensates and vesicles with different 
numbers of handles can exist*’ and soft matter defects can be 
shaped as rings® and knots’, the role of particle topology in colloidal 
systems remains unexplored. Here we fabricate and study colloidal 
particles with different numbers of handles and genus g ranging 
from 1 to 5. When introduced into a nematic liquid crystal—a fluid 
made of rod-like molecules that spontaneously align along the so- 
called ‘director’’°—these particles induce three-dimensional direc- 
tor fields and topological defects dictated by colloidal topology. 
Whereas electric fields, photothermal melting and laser tweezing 
cause transformations between configurations of particle-induced 
structures, three-dimensional nonlinear optical imaging reveals 
that topological charge is conserved and that the total charge of 
particle-induced defects always obeys predictions of the Gauss- 
Bonnet and Poincaré-Hopf index theorems''’. This allows us to 
establish and experimentally test the procedure for assignment 
and summation of topological charges in three-dimensional direc- 
tor fields. Our findings lay the groundwork for new applications of 
colloids and liquid crystals that range from topological memory 
devices’, through new types of self-assembly'’*”’, to the experi- 
mental study of low-dimensional topology®”"*». 

Although a coffee mug and a doughnut look different to most of us, 
they are topologically equivalent solid tori or handlebodies of genus 
g = 1, both being different from, say, balls and solid cylinders of genus 
g = 0, to which they cannot be smoothly morphed without cutting'?”. 
In a similar way, molecules can form topologically distinct structures 
including rings, knots and other molecular configurations satisfying 
the constraints imposed by chemical bonds”. Although the topology 
of shapes, fields and defects is important in many phenomena and 
in theories ranging from the nature of elementary particles to early- 
Universe cosmology”*”®, topological aspects of colloidal systems (com- 
posed of particles larger than molecules and atoms but much smaller 
than the objects that we encounter in our everyday life) are rarely 
explored. Typically dealing with particles with surfaces homeo- 
morphic to spheres, recent studies*”'>"*”°-*? demonstrate that the 
topology of curved surfaces dictates the formation of defects during 
two-dimensional colloidal crystallization at fluid interfaces as well as 
inside liquid crystal droplets and around spherical inclusions in liquid 
crystals. However, despite the fact that several techniques for scalable 
fabrication of particles with complex geometric shapes and g > 0 have 


recently been introduced’*'**”’, the potential impact of particle 
topology on colloidal alignment, self-assembly and response to fields 
remains unexplored. 

To study the interplay of particle topology and defects in liquid 
crystals, we fabricated topologically distinct silica particles with planar 
symmetry and handlebody topology of genus g varying from 1 to 5; 
their surfaces had an Euler characteristic y = 2 — 2g ranging from 0 to 
—8 (Fig. 1 and Supplementary Fig. 1). These particles had 1 1m X 1 um 
rounded square cross-sections and ring diameters ranging from 5 to 
10 um. Handlebody particles were introduced into a nematic liquid 
crystal, pentyl cyanobiphenyl, and the ensuing dispersion was infil- 
trated into cells bounded by parallel uniformly separated substrates 
treated to impose either perpendicular (homeotropic) or parallel align- 
ment of the director n and thereby to create a uniform director no in 
their interior in the absence of inclusions. Before dispersion, the sur- 
faces of particles were also treated to induce perpendicular boundary 
conditions for n. The director field n(r) around these handlebody 
colloids, which approaches mp at large distances, was probed optic- 
ally by a combination of transmission-mode polarizing microscopy 
(PM) and three-photon excitation fluorescence polarizing microscopy 
(3PEF-PM)*, schematically shown in Supplementary Fig. 2. Holo- 
graphic optical tweezers allowed non-contact optical manipulation 
of particles at laser powers of 5-50 mW and local photothermal mel- 
ting of the liquid crystal into an isotropic state at powers of about 
100 mW and higher. Because of the strong surface anchoring and 
rounded cross-section of the colloids, quenching the liquid crystal 
from the isotropic to the nematic phase creates director configurations 
that vary smoothly away from homeotropic alignment at the particle 
surfaces and that also exhibit bulk defects. 

Colloidal handlebodies spontaneously align with their ring planes 
either perpendicular or parallel to mp (Figs 1 and 2 and Supplementary 
Figs 3-9). The prevailing alignment of handlebodies perpendicular to 
Mo is more common because it minimizes the elastic free energy of 
n(r) distortions induced by the particles with perpendicular boundary 
conditions. Handlebodies aligned with ring planes parallel to mp are 
obtained by melting and subsequently quenching the surrounding 
liquid crystal with laser tweezers (Fig. 2). These particles can also be 
made to align parallel to or obliquely to mp by confinement in cells 
of thickness comparable to their lateral dimensions (Supplementary 
Fig. 10). Handlebodies are elastically repelled from both confining sub- 
strates as a result of strong surface anchoring conditions. However, 
because of the density mismatch between silica and the liquid crystal, 
they tend to rest somewhat below the cell midplane, where gravity is 
balanced by the elastic forces (Supplementary Fig. 9). 

Optical micrographs obtained by using different imaging modalities 
(Fig. la—h) reveal that handlebody colloids aligned perpendicular to mo 
are all surrounded by single half-integer exterior disclination loops of 
topological point defect (hedgehog) charge m = —1 but have different 
defects within their interiors (Fig. li-l). Each genus-g particle has g 
defects in its holes, which are either singular disclination loops or 
hyperbolic point defects of topological hedgehog charge m= +1. 
Disclination loops in the holes of each handlebody can be transformed 
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Figure 1 | Colloidal handlebodies aligned orthogonal to the far-field 
director. a-d, g = 5 handlebodies and induced n(r) structures imaged by PM 
without (a) and with (b) a retardation plate, bright-field microscopy (c) and 
3PEF-PM (d) techniques. e-h, 3PEF-PM textures of single (e), double (f), triple 
(g) and quadruple (h) handlebodies. P, A and y mark the crossed polarizer, 
analyser and a slow axis of a retardation plate (aligned at 45° to P and A), 
respectively. In the images (d-h) obtained by overlaying 3PEF-PM fluorescence 


into point defects and vice versa by melting the liquid crystal into an 
isotropic state with tweezers of laser power more than 100 mW and 
subsequently quenching into a nematic phase, indicating that free 
energies due to director configurations with these defects are compar- 
able. These hedgehog charges of the point defects and disclination 
loops have been determined by assuming that the vector field lines 
point perpendicularly outwards from the particle surfaces (Fig. 3) and 
by mapping the vector fields around particles, point defects and dis- 
clination loops onto the order-parameter space*’**. Because n has non- 
polar symmetry (that is, m is equivalent to — n), one could have chosen 
the vector field pointing inwards to the surface of colloids, which 
would consequently reverse the signs of all hedgehog charges induced 
by particles in a uniformly aligned liquid crystal. The relative charges 
of all the defects would remain the same, as would the net charge of 0, 
ensuring conservation of topological charge. 

Colloidal particles oriented with their rings parallel to mp tend to 
induce point defects both within the holes and next to the particles 
(Fig. 2). The point defects occasionally open into disclination loops that 
follow the curved edge faces of particles and have a topological charge 
equivalent to that of the point defect that they replace (Fig. 2b, e, h, k). 
Although the handlebodies oriented perpendicular and parallel to no 
induce a different director field n(r), the sum of hedgehog charges 
due to induced point defects and disclination loops, $>;m;=—m, 
= + 7/2, compensates for the colloidal particles’ hedgehog charge m-. 
due to n(r) at their surfaces and is uniquely predetermined by particle 
topology (Fig. 3). The signs depend solely on the choice of the direction 
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intensity patterns for two orthogonal polarizations of excitation light, green and 
magenta colours correspond to the polarization directions marked by green 
and magenta arrows, respectively. Cross-sectional xz and yz images were 
obtained along yellow lines shown on the corresponding in-plane images. 

i-l, Diagrams of n(x) (black lines) around g handlebodies. Red and magenta 
lines show outer and inner disclination loops of m = —1andm = +1 hedgehog 
charges, respectively. Magenta spheres show m = +1 hyperbolic point defects. 


of the vector field at the surface of particles. This relation holds for all 
colloidal handlebodies (g = 1,2,...,5) and for spherical colloids with 
g=0and )°,m;= + 7/2 = +1 (Supplementary Fig. 12) studied previ- 
ously’*, and can be understood using simple considerations based on 
the Gauss—Bonnet theorem”. Recall that the topological charge m, of 
any region of space V bounded by a surface S = OV is the degree'** of 
n along S, which can be calculated by integrating the Jacobian of n(r) 


over that surface’*?*?, m,=(1/4n) | dx,dx,n-0,n x 0.n. Because 


n(r) aligns with the (outer) unit iinet field to the colloidal surface 
S, the integral reduces to the total Gauss curvature of S divided by 4n. 
The Gauss-Bonnet theorem” states that the total Gauss curvature of a 
closed surface without boundary is quantized in units of 4m equal to 
4n(1 — g) = 2my and remains unchanged during all continuous defor- 
mations of the surface; it follows that the hedgehog charge m, of n(r) 
along S is (up to sign) m,. = +2ny/(4m) = +(1—g). Because the direc- 
tor is roughly constant (along mo) far from the colloidal inclusions, an 
imaginary surface surrounding the colloids and all other defects will 
have a net zero charge. It follows that the sum of the defect charges must 
cancel the degree on the colloidal surface S, and so the total hedgehog 
charge of point defects and disclination loops will be $°,m; = —m,_= 
+4/2= +(1—g), regardless of the orientation of the particles with 
respect to mg, as observed experimentally. 

The diagram in Fig. 3g shows that both interior and exterior discli- 
nation loops of the configurations shown in Fig. 1 can be trans- 
formed to hyperbolic point defects of equivalent hedgehog charge. 
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Figure 2 | Colloidal g handlebodies aligned parallel to the far-field director. 
a-i, Polarizing (a-f) and bright-field (g-i) textures and corresponding 
diagrams of n(r) for different colloidal tori. Magenta and red spheres show the 
m= +1andm = —1 hyperbolic point defects, respectively. j-1, The red loop in 
k shows a curved half-integer disclination ring with hedgehog charge m = —1 
observed when a hyperbolic point defect near a solid torus (a, j) opens into a 
disclination loop (b, k). The black lines in j-1 depict n(r) in the plane of colloidal 
handlebodies (yellow) and in the plane orthogonal to the handlebodies (blue), 
both planes being parallel to mo. 


Furthermore, these structures can be also transformed into a non- 
singular twist-escaped looped n(r) configuration with a net topological 
hedgehog charge equal to zero (Fig. 3h) and resembling the “bubble 
gum’ structure studied previously~’”’. Although perpendicular bound- 
ary conditions due to the handlebody-shaped particles in the liquid 
crystal with a uniform no can be satisfied by a minimum number 
of point or ring defects of the same sign having the total hedge- 
hog charge of + 7/2 (that is, no singularities for a solid torus, as shown 
in Fig. 3h, and g — 1 point or ring defects for a handlebody of genus g), 
these field configurations relax to topology-satisfying n(r) that 
also minimize the free energy. The energetic cost of introducing 
colloids into liquid crystal is dominated by the elastic energy 


P= >| {Ku(V-n)+ Kila xn)? + Kx V xn) where Kj, 


K, and K; are splay, twist and bend elastic constants, respectively, 
although the total energy additionally includes the surface energy 
due to finite surface anchoring of n(r) at the particle surfaces, the con- 
tribution of flexoelectric terms, and the energy of defect cores that can 
be treated as having a reduced order parameter or a biaxial nature’’. 
The surfaces of handlebody colloids have regions with opposite curva- 
ture, thus inducing the corresponding distortions of n(r) that mini- 
mize elastic energy for perpendicular boundary conditions at their 
surface. This results in the appearance of additional self-compensating 
pairs of defects of opposite hedgehog charge, leading to a number of 
defects that exceeds the minimum number, g— 1, required by topo- 
logy. In the experimentally studied systems, colloidal g-handlebodies 
typically induce g + 1 individual singularities. Of these, g — 1 defects 
are of the same charge and are dictated by the particle topology, and 
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Figure 3 | Director field and point defects around colloidal handlebodies. 
a-d, Diagrams showing the vector-field representations of n(r) (green lines 
with arrows) around single (a, c) and double (b, d) colloidal handlebodies in the 
plane of the rings (c, d) and in planes orthogonal to them (a, b). e, f, Diagrams of 
n(r) around hyperbolic point defects of negative (e) and positive (f) topological 
signs shown by red and magenta filled circles. g, h, Diagrams showing elastic- 
energy-costly unstable n(r) structures with point defects of opposite m = +1 
(g) hedgehog charges and non-singular twist-escaped configuration (h) near a 
handlebody oriented perpendicular to mo. The insets in h show the detailed 
vector field of the escaped axially symmetric configuration corresponding to 
non-singular n(r) of m = 0. The cross-section of the field around a handlebody 
resembles that of an integer-strength disclination loop and is compensated for 
by an integer-strength disclination loop of opposite strength; the singularity of 
the latter is removed by an ‘escape in the third dimension’ by means of 
continuous deformations as shown in the insets to h. 


two additional defects with opposite signs (total hedgehog charge 
zero) appear to relax n(r) distortions to minimize the free energy. 
Because K, < K, < K; for pentyl cyanobiphenyl’’, some of the splay 
distortions in the holes of the handlebodies confined into thin cells 
transform into more complex configurations, as demonstrated by 
spiralling dark and bright brushes in PM and 3PEF-PM images 
(Fig. la—d, g and Supplementary Figs 3 and 8). Although the structures 
shown in Fig. 3g, h and Supplementary Fig. 11 are unstable because of 
their high free energy and are found to relax to other topologically 
equivalent stable and metastable configurations around individual 
colloidal handlebodies (Figs 1 and 2), there is a possibility that they 
could be stabilized by confinement in twisted liquid crystal cells, as 
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previously observed for the ‘bubble gum’ configurations formed 
around colloidal dimers”. 

We have characterized the Brownian motion of colloidal handle- 
bodies (Fig. 4a—c). Their diffusion in a planar cell with thickness much 
larger than the diameter of the handlebody is highly anisotropic 
(Fig. 4a, d) and easier along ny than perpendicular to it. The slopes 
of mean square particle displacements (MSDs), shown in Fig. 4a for 
a solid torus (g= 1), yield diffusion coefficients D, = 0.0023 um*s* 
and D, = 0.0034 1m? s_' measured normal and parallel to mo, respect- 
ively. Being oriented with respect to mo (Fig. 4d), particles also experi- 
ence angular thermal fluctuations (Fig. 4b) with (A0’) of angular 
displacements (MSDg) initially increasing linearly with the lag time 
t and then saturating as a result of the elasticity-mediated alignment. 
The width of the histogram distribution of the angle 0 between the axis 
of revolution of the solid torus and no (Fig. 4b, inset) is 9.6 x 10° > rad. 
The lateral diffusion of these colloids along directions perpendicular 
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to mo in homeotropic nematic cells is isotropic (Fig. 4c). However, 
when characterized in the particle’s body frame, diffusion of g> 1 
handlebodies is anisotropic because of their shape. For example, 
g=2 particles diffuse more easily along their long axis a (Fig. 4c) 
crossing the centres of the two holes than along the short axis b 1 a 
(ref. 27). The average diffusion of g-handlebodies having the same 
diameter of rings decreases with increasing g (Fig. 4c). While being 
elastically trapped in the vicinity of the handlebodies, defects accom- 
panying the particles also undergo thermal fluctuations. 

In addition to laser tweezing and local melting, the relation between 
defects in n(r) and x can also be probed by applying an electric field 
E (Fig. 4d) that causes the rotation of n towards E as a result of 
the liquid crystal’s positive dielectric anisotropy. Two types of response 
have been observed. When E is increased continuously, colloidal 
handlebodies reorient while preserving their alignment with respect 
to the director and following its reorientation towards E normal to the 
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Figure 4 | Diffusion and electric control of colloidal handlebodies in a 
nematic liquid crystal. a, MSD of a solid torus parallel (blue circles) and 
perpendicular (red circles) to mg versus t in a planar nematic cell of thickness 
d= 16 um. The inset shows the corresponding experimental displacement 
histograms (red and blue filled circles) at t = 0.067 s, and Gaussian fits (red and 
blue lines). b, Plot of MSDg of a colloidal torus against t and a histogram of 
angular orientations with respect to mp (inset). c, MSDs of g= 1 and g= 2 
handlebodies in a homeotropic nematic cell of d = 3 um along x, y, a and b axes; 
solid lines are linear fits to the data. Note the decreased mobility of 
handlebodies due to the thin cell confinement. d-f, PM textures of a planar 
nematic cell of d = 17.5 um in the vicinity of a colloidal solid torus at no field 
(d), after continuous increase of applied alternating-current voltage to 
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U=11V (1kHz) within about 10s (e), and after voltage U = 14.8 V was 
applied to the cell abruptly (as a square pulse modulated at 1 kHz) and then 
switched off (f). The image in f was taken at no applied field and after the torus 
and n(r) had relaxed to the long-lived metastable state with the torus parallel to 
No. g, Diagram of n(r) (black lines) in a texture shown in f. h, i, Bright-field 
images of a torus in a planar nematic cell of d = 17.5 um reorienting with the 
liquid crystal director under the applied field E normal to the image. j, Diagram 
showing U-controlled n(r) deformations and rotation of the torus in the 
vertical yz plane. k, Plot of torus tilt angle / against U; the insets show 3PEF-PM 
cross-sectional images along the green dashed line marked in h at 
corresponding U. 
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substrates (Fig. 4e). However, because of slow rotation of the ring 
compared with the roughly 10-ms response time of n(r), an abrupt 
application of Esimply alters n(r) around the particle while preserving 
the initial particle alignment in the cell. For a solid torus (g = 1), this 
causes the original n(r) to transform into a topologically equivalent 
configuration with two disclination loops (Fig. 4f, g). Using different 
voltage-driving schemes, colloidal handlebodies and structures around 
them can be switched between the two bistable orientations and n(r) 
configurations shown in Fig 4d, f that are stable at no applied field. All 
observed transformations of n(r) and orientations of colloids are again 
found to satisfy the relation }> mj = +7/2. 


Our study experimentally supports the procedure for assignment 
of signs of topological defects in three-dimensional n(x) textures before 
their summation that requires a global point of reference, a ‘base point’, 
that serves as a global choice for the overall sign of n(r), held fixed 
during any smooth deformation of the director complexion. Once 
fixed, the non-polar director field can be decorated with a vector, 
and the use of vector-field lines allows us to assign unambiguous signs 
to the defects (Fig. 3a-f)*’. Although used for many decades", the 
convention that all hyperbolic point defects and disclination loops of 
—1/2 strength have charge ‘“— 1’ whereas all radial defects and disclina- 
tion loops of +1/2 strength have hedgehog charge ‘+1’ fails to pro- 
perly describe the topological charge conservation of hedgehog 
charges in the studied three-dimensional textures. The base point 
and the use of vector field lines in the liquid crystal texture until the 
signs of the hedgehog charges are assigned with respect to this base 
point allow a proper addition of hedgehog charges to the net charge of 
+y/2. The signs of topological point defects and the entire n(r) struc- 
ture induced by handlebody colloids then depend on the direction of 
vector field lines at the base point and can be reversed by flipping this 
direction to an opposite one, because of the non-polar nature of 
nematic liquid crystals*’. It is only through the use of a base point that 
defects in different places can be added together like charges so that the 
net topological charge is conserved. Our approach also describes topo- 
logical charge conservation in liquid-crystal textures studied previ- 
ously, as we show in the Supplementary Information with an example 
of a colloidal dimer and surrounding n(r). One can assign and add 
hedgehog charges in n(r) of samples with multiple colloidal particles 
having the same or different y, and the addition of each separate particle 
always contributes a net +7/2 = —m, to the topological charge distri- 
bution of particle-induced bulk defects compensating for m, and ensur- 
ing charge conservation. However, charges induced by particles can 
have opposite signs even within the same texture, thus enabling charge 
annihilation in the textures surrounding these particles, as we show in 
the Supplementary Information with an example of colloidal dimers. 

We have designed and fabricated topologically distinct handlebody- 
shaped colloidal particles and explored the interplay between the topo- 
logy of colloids and the defects that they induce in a uniformly aligned 
liquid crystal. These handlebody colloids are accompanied by topological 
defects with the net hedgehog charge always equal to half of the Euler 
characteristic of the particle surface. Topological colloids and the estab- 
lished procedure for the assignment and summation of topological 
charges in liquid crystals will enable basic studies of topological manifolds 
and the interplay between particle topology and order and disorder” 
with these model systems. Beyond the exploration of the topology of 
colloids, fields and defects, the experimental arena we have developed 
may enable the design of topology-dictated elastic colloidal interactions 
and reconfigurable self-assembly in liquid crystals’*, the entrapment 
and scaffolding of nanoparticles by particle-induced defects”, the self- 
assembly of reconfigurable topological memory devices", and electro- 
optic and photonic devices based on bistable switching between different 
states with distinct director configurations and orientations of particles. 


METHODS SUMMARY 


Fabrication of silica (SiO) particles with handlebody topology 
involved the following procedure. First, a 90-nm sacrificial layer of 
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aluminium was sputtered on a silicon wafer. Next, a 1-m silica layer 
was deposited on the aluminium by plasma-enhanced chemical 
vapour deposition. Photoresist AZ5214 (Clariant AG) was spin-coated 
on the silica layer. The pattern of rings was defined in the photoresist 
by illumination at 405 nm with a direct laser-writing system (DWL 
66FS; Heidelberg Instruments) and then in the silica layer by induc- 
tively coupled plasma etching. Finally, the photoresist was removed 
with acetone and the aluminium was wet-etched with sodium hydro- 
xide aqueous solution so that the handlebody particles were released 
and then re-dispersed in deionized water (Supplementary Fig. 1). To 
define perpendicular boundary conditions for n(r) on the surface of 
particles, they were treated with an aqueous solution (0.05wt%) of 
N,N-dimethyl-N-octadecyl-3-aminopropyl-trimethoxysilyl chloride 
(DMOAP) and then re-dispersed in methanol. After the addition of 
pentyl cyanobiphenyl and the evaporation of methanol at 70 °C over- 
night, the ensuing nematic dispersion was infiltrated into cells com- 
posed of indium-tin-oxide (ITO)-coated glass plates separated by 
glass spacers defining the cell gap. Cell substrates were treated with 
DMOAP to achieve perpendicular mp or coated with polyimide PI2555 
(HD Microsystems) for in-plane alignment of mo defined by rubbing. 
Optical manipulation and three-dimensional imaging of samples were 
performed with an integrated setup of holographic optical tweezers 
and 3PEF-PM (Supplementary Fig. 2)”*°° built around an inverted 
microscope IX 81 (Olympus) and using a 100% oil-immersion objec- 
tive (numerical aperture 1.4). Holographic optical tweezers used a 
phase-only spatial light modulator (Boulder Nonlinear Systems) and 
an ytterbium-doped fibre laser (IPG Photonics) operating at 1,064 nm. 
3PEF-PM employed a tunable (680-1,080 nm) Ti-sapphire oscillator 
(Coherent) emitting 140-fs pulses at a repetition rate of 80 MHz, and a 
photomultiplier tube detector H5784-20 (Hamamatsu)”’. 
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Large-scale nanophotonic phased array 


Jie Sun', Erman Timurdogan', Ami Yaacobi', Ehsan Shah Hosseini! & Michael R. Watts! 


Electromagnetic phased arrays at radio frequencies are well known 
and have enabled applications ranging from communications to 
radar, broadcasting and astronomy’. The ability to generate arbi- 
trary radiation patterns with large-scale phased arrays has long 
been pursued. Although it is extremely expensive and cumbersome 
to deploy large-scale radiofrequency phased arrays’, optical phased 
arrays have a unique advantage in that the much shorter optical 
wavelength holds promise for large-scale integration’. However, 
the short optical wavelength also imposes stringent requirements 
on fabrication. As a consequence, although optical phased arrays 
have been studied with various platforms** and recently with chip- 
scale nanophotonics” ”’, all of the demonstrations so far are restricted 
to one-dimensional or small-scale two-dimensional arrays. Here we 
report the demonstration of a large-scale two-dimensional nano- 
photonic phased array (NPA), in which 64 x 64 (4,096) optical 
nanoantennas are densely integrated on a silicon chip within a 
footprint of 576,1m X 576m with all of the nanoantennas pre- 
cisely balanced in power and aligned in phase to generate a 
designed, sophisticated radiation pattern in the far field. We also 
show that active phase tunability can be realized in the proposed 
NPA by demonstrating dynamic beam steering and shaping with an 
8 X 8 array. This work demonstrates that a robust design, together 
with state-of-the-art complementary metal-oxide-semiconductor 
technology, allows large-scale NPAs to be implemented on com- 
pact and inexpensive nanophotonic chips. In turn, this enables 
arbitrary radiation pattern generation using NPAs and therefore 
extends the functionalities of phased arrays beyond conventional 
beam focusing and steering, opening up possibilities for large-scale 
deployment in applications such as communication, laser detec- 
tion and ranging, three-dimensional holography and biomedical 
sciences, to name just a few. 

An NPA, resembling its radiofrequency counterpart, consists of 
several typically identical optical antennas. Each antenna emits light 
of a specific amplitude and phase to form a desired far-field radiation 
pattern through interference of these emissions. The short wavelength 
of light offers potential for NPAs to greatly exceed the number of 
elements found in their radiofrequency counterparts, with thousands 
or even millions of elements in a compact, low-cost chip. By incorpo- 
rating a large number of antennas, high-resolution far-field patterns 
can be achieved and therefore arbitrary radiation patterns can be 
generated by an NPA, extending the functionalities of phased arrays 
well beyond the conventional beam focusing and steering. However, 
the short optical wavelength also presents challenges in realizing 
coherent outputs from such large-scale NPAs because even nanoscale 
dimensional fluctuations affect the optical emission from the thousands 
of nanoantennas that need to be balanced in power and aligned in 
phase to form a specific far-field radiation pattern. As a consequence, 
all chip-based two-dimensional NPAs demonstrated so far are limited 
to small-scale implementations with no more than 16 antennas, and 
their functionalities are thereby constrained to conventional single- 
beam focusing and steering. 

Here we report the demonstration of a large-scale, compact NPA sys- 
tem that is compatible with a complementary metal-oxide-semiconductor 


(CMOS) process, consisting of 64 X 64 optical nanoantennas on a 
silicon chip with all 4,096 optical nanoantennas balanced in power 
and aligned in phase to produce a sophisticated radiation pattern, the 
MIT logo, in the far field. To our knowledge, this demonstration 
represents the largest coherent combination of nanophotonic ele- 
ments so far. It also shows that despite the short optical wavelength 
and corresponding length of the phase elements, the phase of the 
elements can be maintained, highlighting the ability to make arbitrary 
manipulations of the phase of an optical field within a nanophotonic 
chip. In addition, we demonstrate that active phase tunability can be 
successfully implemented in the proposed NPA structure to generate 
dynamic far-field patterns. 

The structure of the 64 x 64 NPA is illustrated schematically in 
Fig. la, and Fig. 1b shows a scanning electron micrograph (SEM) of 
the fabricated NPA. A laser input is coupled into the main silicon bus 
waveguide through an optical fibre, and then evanescently coupled 
into 64 row waveguides. The coupling to the row waveguides is con- 
trolled in such a way that each row waveguide obtains the same 
amount of power by varying the length of the directional coupler 
(Supplementary Fig. 1). The optical power in each row waveguide is 
then similarly divided into 64 optical antenna units so that all 4,096 
optical nanoantennas are uniformly excited. 

Each of the 4,096 optical antenna units, or ‘pixels’, consists of two 
major functional parts (Fig. la, inset): a nanoantenna to efficiently 
deliver optical power to free space, and two optical phase delay lines 
to accurately adjust the phase of the light arriving at the nanoantenna 
(Supplementary Fig. 1). Figure 1c shows the SEM of the fabricated 
pixel. Ideally, the pixel size should be less than half of the free-space 
wavelength /, of the optical emission in both the x and y directions for 
the radiation to have a unique interference pattern in the far field 
without high-order radiation lobes'*™. It is therefore crucial to make 
the pixel as small as possible, so as to decrease the number of high- 
order interference patterns in the far field. A compact and efficient 
optical nanoantenna is thus demanding for the NPA system. Optical 
nanoantennas down to subwavelength size have previously been 
demonstrated’*"” with metal; however, to make the material systems 
compatible with the CMOS process, silicon-based dielectric nano- 
antennas are used (Fig. 1c). The compact grating-based optical nano- 
antenna measures 3.0 Lm in length, 2.8 jum in width, and 0.22 um in 
thickness, consisting of only five grating teeth. A partial etch 0.11 um in 
depth is applied to the first grating groove, to break the up-down 
symmetry of the nanoantenna for more upward emission”®. Figure 2a 
shows the simulated near-field emission pattern from a three- 
dimensional finite-difference time-domain simulation, in which 51% 
of the optical power emits upwards while only 30% emits downwards 
at Ag = 1.55 uum. Figure 2b illustrates the calculated far-field pattern of 
the optical nanoantenna, using the near-to-far-field transformation”. 
The emission is not vertical because the grating period is slightly 
detuned from that of a second-order grating that would emit vertically, 
to suppress the resonant back-reflections; otherwise, significant reflec- 
tion would interfere with the light propagation in the phased array. The 
emission from the nanoantenna is also broadband, with a bandwidth 
extending across hundreds of nanometres in wavelength, an inherent 
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Figure 1 |The NPA system. a, Schematic illustration of a 64 X 64 NPA 
system. Laser input from an optical fibre is delivered equally to each of the 4,096 
nanoantennas through silicon waveguides. The inset shows a diagram of a 
close-up view of one antenna unit cell (m, n), or ‘pixel’. The coupling efficiency 
is varied by the length of the directional coupler L,, and the emitting phase is 
controlled by two segments of optical delay lines to achieve a desired phase 


characteristic of the short length of the grating (see also Supplemen- 
tary Fig. 2). 

As mentioned above, the ability to integrate a large number of pixels 
in the NPA within a small footprint opens up the possibility of using 
NPA to generate arbitrary, sophisticated far-field radiation patterns. 
The far-field radiation field E(0,@) of the phased array is calculated as 
the far field of an individual nanoantenna S(0,) (Fig. 2b), multiplied 
by the array factor F,(0,)? 


E(0,9) = S(0,6) X F.(0,6) (1) 


where the array factor is a system factor that is related to the phase of 
optical emission from all the pixels. 0 and ¢ are the far-field azimuth 
angle and polar angle, respectively. By assigning the optical phase Qn, 
of each pixel (where m and n are the pixel indices) in the NPA, the 
desired radiation pattern E(0,~) can be achieved. The phase 9, of 
each pixel can be determined by antenna synthesis through the 
Gerchberg-Saxton algorithm” (Supplementary Fig. 3). The back- 
ground pattern in Fig. 2c shows the simulated array factor pattern of 
the 64 X 64 NPA at a wavelength of 1.55 jim, designed to generate the 
MIT logo in the far field. The pixel pitch is chosen to be 9 um in both 
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Figure 2 | Device and system simulations. a, Three-dimensional near-field 
emission simulation of the designed optical nanoantenna using the finite- 
difference time-domain method. b, The corresponding far-field radiation 
pattern of the optical nanoantenna, calculated from the near-field emission (in 
a), using the near-to-far-field transformation. c, The radiation pattern of the 
designed 64 X 64 NPA system to generate the MIT logo in the far field, which is 
a superposition of the far field of the system’s array factor (as shown in the 
background) and that of the nanoantenna (in b). The green circle in the centre 
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delay of Qn. b, SEM of part of the 64 X 64 NPA system fabricated at a CMOS 
foundry. c, A close-up SEM of one pixel of the NPA system, indicated by the 
green rectangle in b. The pixel size is 9 jm 9 tm, with a compact grating as an 
optical nanoantenna, where the first groove of the grating is partly etched to 
enhance the upward emission. The emitted phase of each pixel is adjusted by 
the length of the optical delay line within the pixel. 


the x and y directions, as will be used in fabrication. The pixel pitch is 
a multiple of the free-space half-wavelength. The designed pattern 
is therefore replicated in the far field, as shown by the background 
pattern in Fig. 2c, because the same interference conditions occur 
periodically in the far field to produce higher-order patterns. Equa- 
tion (1) indicates that the final far-field radiation pattern of the NPA 
is a multiplication of the far field of the individual nanoantenna S(0,¢) 
(Fig. 2b) and that of the array factor F,(0,) (background pattern in 
Fig. 2c), as shown in Fig. 2c. It is seen that the designed pattern is 
only visible in the vicinity of the zenith, owing to the directional 
emission of the optical nanoantenna. Figure 2d shows a close-up view 
of the far field radiation pattern around the zenith, where only the 
top rows have enough intensity to be observed. 

The simulation shows that, in principle, arbitrary radiation patterns 
can be produced in the far field with large-scale NPAs, by controlling 
the emitted phase of all the pixels correctly. However, observing the 
short optical wavelength (1.55 um) and the high refractive index of 
silicon (n = 3.48), it is almost impossible to provide every pixel exactly 
with the designed phase Q,,,,, because slight fabrication imperfec- 
tions will cause significant phase errors. As a consequence, only if 
the designed NPA system is highly resistant to phase errors can 
this large-scale NPA be fabricated reliably and function properly. 


indicates the viewable region in a microscope lens (numerical aperture = 0.4) 
used later in the experiment (Fig. 3a). d, A close-up view of the viewable region 
of the far field displaying the MIT logo. The inset on the lower right shows the 
targeted MIT logo pattern. The radiation fields in b-d are viewed from the 
zenith of the far-field hemisphere, as a projection of the far-field hemisphere to 
the equatorial plane in the polar coordinate system (0,(). 0 and @ are the far- 
field azimuth angle and polar angle, respectively. 
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Fortunately, the designed large-scale NPA system is indeed highly 
tolerant of phase errors (Supplementary Fig. 4). This high phase-error 
tolerance of the designed NPA system originates from its nature as a 
Fourier system, in which the phase noise of the near-field emission 
averages out in the far field through interference of optical emissions 
from all the pixels. This high phase-error tolerance therefore becomes 
more effective with more pixels and will thus enable NPAs to scale up 
beyond 64 x 64, even to millions of pixels. 

The robustness of the designed NPA demonstrates such a system 
can be fabricated reliably. The designed 64 * 64 NPA was fabricated in 
a 300-mm CMOS foundry by 193-nm optical immersion lithography 
(Methods). The 4,096 pixels were accurately reproduced, as shown by 
the SEMs in Fig. 1b, c. A 1.55-j1m laser was coupled into the transverse 
electric mode of the main bus waveguide. Then the near field and far 
field images were captured with an infrared camera, using a setup 
(Fig. 3a) similar to that in ref. 25. Figure 3b shows the near-field image 
of the fabricated NPA system when light was coupled in, where uni- 
form optical emission was observed across all 4,096 nanoantennas. 
Figure 3c shows a close-up view containing 8 X 8 pixels. It is crucial 
for an NPA to have all the pixels emit with a desired amplitude pattern, 
which is uniform here, to provide an ideal interference condition in 
the far field. Figure 3d quantitatively analyses the uniformity of the 
optical emission by measuring the near-field emission intensity of 
each pixel with the infrared camera. A high degree of uniformity is 
observed. The top-left corner of the NPA is noisier, because it is close 
to the fibre coupling point where strong scattering occurs, but this 
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Figure 3 | Experimental results. a, A diagram of the imaging system used to 
observe the near field and far field. Lens 1 alone (numerical aperture = 0.40) 
was used to obtain the near-field (NF) image, as shown by the green rays. The 
far-field (FF) image, or the Fourier image, was taken by moving lens 1 down so 
as to form the far-field image in its back-focal plane (Fourier plane), and 
inserting lens 2 to project the far-field image on the infrared charge-coupled 
device (CCD), as shown by the blue rays. b, The near-field image shows 
uniform emission across all of the 64 X 64 (4,096) nanoantennas. The input bus 
waveguide is located on the top left corner, causing some excess scattering 
noise. c, A close-up view of part of the near field, containing 8 X 8 pixels. 

d, Measured intensity distribution of the optical emission from the pixels. The 


LETTER 


does not reflect non-uniformity in the array itself and can readily be 
addressed with a larger separation from the fibre input. 

Figure 3e shows the measured far-field radiation pattern of the 
fabricated 64 X 64 NPA. The image reveals that the designed sophis- 
ticated radiation pattern (MIT logo) is generated in the far field. The 
far-field image is clamped by the finite numerical aperture (0.4) of 
lens 1. This is also predicted by simulations, as shown by the green 
circle in Fig. 2c, d, where only emission with a small divergence angle 
from vertical can be captured. The intensity noise in the background 
of the far-field image comes from the light scattering caused by fibre- 
to-waveguide coupling. The scattered light is also responsible for the 
concentric fringes in the background, through the interference of 
the scattered light between the top and bottom surfaces of the sil- 
icon-on-insulator wafer. This noise can be removed by placing the 
fibre-waveguide coupler farther from the NPA system to reduce the 
light scattering captured by the imaging column, and a much cleaner 
far-field radiation pattern would be expected. Figure 3f shows the 
captured radiation pattern of a 32 X 32 NPA on the same chip, in 
which less noise is observed because it is placed a little farther from 
the fibre coupling point; however, the far-field pattern resolution is 
reduced because it contains fewer pixels. The measured images agree 
perfectly with simulations in Fig. 2c, d, in terms of the shape of the 
pattern (MIT logo) and the relative intensity of all interference orders, 
highlighting the robustness of the NPA design and the accuracy of 
the fabrication. Comparing Fig. 3b with Fig. 3e, the near-field image 
of the NPA contains no information because it shows plain uniform 
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statistics show that the standard deviation (c) of the emission intensity is 13% of 
the average intensity (1). e, The far-field radiation pattern of the fabricated 
64 X 64 NPA system. The aperture (indicated by the green circle) corresponds 
to the numerical aperture (0.4) of lens 1 in a. The measurement accurately 
matched simulations in Fig. 2c, d, except for the higher background noise 
introduced by the light scattered from fibre-to-waveguide input coupling. 

f, The far-field radiation pattern of a 32 X 32 NPA on the same chip. Less noise 
but lower resolution is observed. The concentric ring pattern superimposed in 
the image is caused by the interference of the scattered light between the top and 
bottom surfaces of the silicon-on-insulator wafer. 
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emission everywhere; however, the far field explicitly delivers the 
information—an image with the MIT logo. Until now, image informa- 
tion has generally been stored and transmitted through the intensity 
of the pixels; in contrast, this large-scale NPA technology opens 
up another dimension for imaging: the image information is now 
encoded in the optical phase of the pixels, much like a hologram, 
but generated from a single point. This demonstration, as a static 
phased array capable of generating truly arbitrary radiation patterns, 
would also find immediate applications in, for example, complex 
beam generation’® and mode matching in optical space-division 
multiplexing”. 

By comparison with other holographic approaches such as the 
metasurface antennas’’, the NPA allows separate control over the 
phase and amplitude of light emission and on-chip single-point excita- 
tion of the nanophotonic emitters, enabling truly arbitrary holograms 
to be generated entirely on-chip for the first time; moreover, benefiting 
from the use of guided light in silicon instead of free-space light, 
active manipulation of the optical phase can be directly implemented 
to achieve dynamic far-field patterns with more flexibility and wider 
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Figure 4 | Tunable phased array. a, Schematic illustration of the 8 X 8 active 
phased array. Inset, diagram of an active pixel with the same size (9 [tm X 9 Jum) 
as the passive pixel in Fig. la. The optical phase of each pixel is continuously 
tuned by the thermo-optic effect through an integrated heater formed by doped 
silicon. b-f, Examples of the dynamic far-field patterns generated by the 8 X 8 
active phased array by applying different voltage combinations to the pixels, 

showing simulations and measurements. b, The original single-beam pattern 
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applications, by converting the pixel into a thermally phase-tunable 
pixel in a CMOS process (Fig. 4a, inset). A portion of the silicon light 
path in each pixel is lightly doped with an n-type implant to form a 
resistive heater for thermo-optic phase tuning while maintaining a low 
loss of light propagation. Two narrow silicon leads with heavy n° - 
doping, providing electrical connections to and thermal isolation from 
the heater, are connected to the heater on the inner side of the adiabatic 
bends to minimize the loss caused by light scattering’*”’. Figure 4a 
illustrates an active 8 X 8 NPA in which each pixel has an indepen- 
dently tunable phase shifter; the electrical controls are connected in 
rows and in columns to simplify the electrical circuitry. This active 
8 X 8 phased array was fabricated with CMOS-processing techniques 
(Methods). The measured resistance is 2.5 kQ per heater including the 
two copper-silicon contacts, and a high thermal efficiency of about 
8.5mW per m-phase shift is achieved, benefiting from the direct 
heating of the silicon waveguide. By applying different voltages on 
each pixel, different phase combinations can be achieved in the phased 
array to generate different radiation patterns dynamically in the far 
field, as shown in Fig. 4b-f, in which five different radiation patterns 
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with no voltage on. c, d, The focused beam is steered by 6° to the edge of each 
interference order in the vertical (c) and in the horizontal (d). e, The single 
beam is split into two beams in the vertical direction. f, The single beam is split 
into four beams in the horizontal direction. The green circle indicates the edge 
of lens 1 (numerical aperture = 0.4), and the red box specifies the area of one 
interference order (see also the Supplementary Movie for the dynamic pattern 
generation). 
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are demonstrated using the same phased array: a standard single-beam 
pattern with no voltage on the array; beam-steering for about 6° in 
both the vertical direction and the horizontal direction; generation of 
two beams in vertical direction; and generation of four beams in the 
horizontal direction. Good agreement between simulation and experi- 
ment is again observed, which confirms the robustness of the proposed 
design as well as the accuracy of the fabrication and active thermo- 
optic phase tuning. The proposed active NPA structure can be imme- 
diately extended to a larger phased array (for example 64 X 64, as 
discussed above) with independent electrical control of each pixel with 
the aid of fully CMOS-controlled circuitry to access all of the pixels 
electrically, to project dynamic patterns in the far field with applica- 
tions ranging from communications, three-dimensional hologra- 
phic displays, laser detection and ranging (LADAR) and biomedical 
imaging to interferometry. 

We have demonstrated a large-scale 64 X 64 nanophotonic phased 
array built on a silicon photonic platform. The robustness and CMOS 
compatibility of the designed system enabled the successful integration 
of 4,096 optical nanoantennas within a chip-size footprint, probably 
the largest and densest silicon photonic functional system demon- 
strated so far, and certainly the largest coherent demonstration of 
silicon photonics. Optical power is uniformly distributed in the nano- 
antennas and is phase aligned to produce a sophisticated far-field 
radiation pattern, the MIT logo, as designed. The large number of 
nanoantennas and the embedded phase tunability enable NPAs to 
generate arbitrary far-field radiation patterns dynamically and, in 
turn, to affect new fields such as communication, LADAR, three- 
dimensional holography and biomedical sciences. The ability to take 
advantage of a state-of-the-art CMOS integration process also pro- 
mises a bright future for low-cost and compact NPAs that will be 
important in many fields with their newly explored functionalities. 


METHODS SUMMARY 


The NPAs were fabricated in a 300-mm CMOS foundry with a 65-nm technology 
node, using silicon-on-insulator wafers with a 0.22-u1m top silicon layer and 2 um 
buried oxide. A timed partial silicon etch (0.11 jum) was first performed to make 
the partly etched grating groove. A full silicon etch was then applied to form the 
waveguides and grating nanoantennas. Subsequent n and n’ dopings were 
implanted for active arrays, followed by standard silicidation to make copper- 
silicon contacts. The contacts were connected to on-chip probing pads by two 
metal layers for thermo-optic tuning. SiO, with a total thickness of 3.6 um was 
used to cover the devices, with a final polishing step to make the surface planar to 
avoid additional phase errors due to surface corrugation. 
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Carbon-dioxide-rich silicate melt in the Earth’s 


upper mantle 


Rajdeep Dasgupta’, Ananya Mallik', Kyusei Tsuno', Anthony C. Withers’, Greg Hirth* & Marc M. Hirschmann? 


The onset of melting in the Earth’s upper mantle influences the 
thermal evolution of the planet, fluxes of key volatiles to the exo- 
sphere, and geochemical and geophysical properties of the mantle. 
Although carbonatitic melt could be stable 250 km or less beneath 
mid-oceanic ridges’”, owing to the small fraction (~0.03 wt%) its 
effects on the mantle properties are unclear. Geophysical measure- 
ments, however, suggest that melts of greater volume may be pre- 
sent at ~200 km (refs 3-5) but large melt fractions are thought to be 
restricted to shallower depths. Here we present experiments on 
carbonated peridotites over 2-5 GPa that constrain the location 
and the slope of the onset of silicate melting in the mantle. We find 
that the pressure-temperature slope of carbonated silicate melting 
is steeper than the solidus of volatile-free peridotite and that 
silicate melting of dry peridotite + CO, beneath ridges commences 
at ~180km. Accounting for the effect of 50-200 p.p.m. H,O on 
freezing point depression, the onset of silicate melting for a sub-ridge 
mantle with ~100 p.p.m. CO, becomes as deep as ~220-300 km. We 
suggest that, on a global scale, carbonated silicate melt genera- 
tion at a redox front ~250-200 km deep®, with destabilization of 
metal and majorite in the upwelling mantle, explains the oceanic 
low-velocity zone and the electrical conductivity structure of the 
mantle. In locally oxidized domains, deeper carbonated silicate melt 
may contribute to the seismic X-discontinuity. Furthermore, our 
results, along with the electrical conductivity of molten carbonated 
peridotite’ and that of the oceanic upper mantle’, suggest that man- 
tle at depth is CO2-rich but H,O-poor. Finally, carbonated silicate 
melts restrict the stability of carbonatite in the Earth’s deep upper 
mantle, and the inventory of carbon, H,O and other highly incom- 
patible elements at ridges becomes controlled by the flux of the former. 

Knowledge of mid-oceanic-ridge basalt generation suggests that the 
initiation of silicate melting in an upwelling mantle is restricted to the 
top 70-85 km (refs 8, 9). This constraint is potentially at odds with 
observations of a seismic low-velocity zone that extends as deep as 
200 km (ref. 4). Although the low shear-wave speed of oceanic mantle 
can perhaps be explained by high temperatures", or hydroxyl groups 
in nominally anhydrous minerals'', correlation between velocity 
anomalies and high attenuation’ may require the presence of partial 
melts at depths even in excess of 200 km. Intra-plate upper-mantle 
electrical conductivity, with a possible anomaly at ~220 km depth*”, 
may also require the presence of a highly conductive melt. Yet silicate 
melts are not thought to be generated at depths approaching or greater 
than 200 km. 

If Earth’s upper mantle is locally oxidized at depth, then carbonatitic 
melt could be generated atop the transition zone in upwelling mantle 
beneath ridges. However, with =100p.p.m. CO, in the mantle, 
only =0.03 wt% of carbonatitic melt is produced and the effect of such 
a small melt fraction on mantle physical properties is unclear. 
Moreover, mantle at depths =250 km, on a global scale, is shown to 
be metal-saturated®, thereby destabilizing a carbonatitic melt. Redox 
melting probably begins at 250-200 km depth in upwelling sub-ridge 
mantle, but the nature of generated melt at this redox front—that is, 


carbonate-rich versus silicate-rich—remains unknown. Dissolved 
CO, + H20 may stabilize a silicate melt at much lower temperatures 
than the volatile-free peridotite solidus, but previous work only con- 
strained the stability of carbonated silicate melt in simplified systems 
(CaO-MgO-Al,03-SiO2 (CMAS) + CO, (see, for example, ref. 13) or 
CaO-MgO-FeO-SiO, + Al,O3 + CO) system (ref. 14)) or at a single 
pressure of 3 GPa (ref. 15) for a natural peridotite system. The key 
information that remained unconstrained was the location and the 
pressure-temperature P-T slope of the onset of carbonated silicate 
melting of natural fertile peridotite; that is, the slope of the melt CO, 
isopleths and carbonate-silicate melt transition in P-T space. 

We conducted partial melting experiments with nominally anhydrous, 
carbonated fertile peridotite at temperatures between 1,300 and 
1,750 °C and pressures between 2 and 5 GPa to constrain the pressure 
dependence of mixing between near-solidus carbonatite and higher- 
temperature silicate melts. Experiments were conducted with a piston- 
cylinder press (at a pressure of 2 GPa) and a Walker-style multi-anvil 
press (at pressures of 4 and 5 GPa) (Methods). A fertile peridotite with 
2.5 wt% CO, (PERC) and 1 wt% CO, (PERC3) (Supplementary Table 
1)’ was used as the starting material. 

Olivine (Ol), orthopyroxene (Opx) and carbonated partial melts 
(c-SL) are present in all the experiments in the presence or absence 
of garnet (Gt) and clinopyroxene (Cpx) (Fig. la and Supplementary 
Table 2). Near the solidus, the melt is carbonatitic (<10 wt% SiO, 
and 38-45 wt% CO, dissolved in the melt) with melt fractions F of ~5- 
6 wt% (PERC) or ~2-3 wt% (PERC3) that are directly proportional to 
the bulk CO, contents (F= C29, /Céo,, assuming perfect incompat- 
ibility of carbon in residual silicates, where C2.4,and Cé,,are the con- 
centrations of CO, in the mantle source and liquid, respectively). 
Carbonatitic melt evolves to a carbonated silicate melt (>25 wt% 
SiO, and =25 wt% CO), dissolved in the melt; Supplementary Table 
3) with increasing temperature as the dissolution of Cpx and/or garnet 
into carbonatitic fluid becomes significant, melt fraction increases and 
Cho, gets diluted. 

The key observation of our experiments is that the temperature of 
onset of carbonated silicate melting, that is, stabilization of a melt with 
~25 wt% COs, is significantly lower than the solidus of volatile-free 
peridotite’® at the pressure range investigated. The transition from 
carbonatite to silicate melts in our study also occurs at a ~200 °C lower 
T than the similar transition observed in the simple CMAS + CO, 
system’, suggesting a crucial function for TiO, Na,O and FeO in 
stabilizing carbonated silicate melts. Our results also demonstrate 
that the temperature difference between the temperature—melt frac- 
tion T-F trends defined for the volatile-free peridotite and carbonated 
systems increases with increasing pressure between 2 and 5GPa 
(Fig. 1b). Thus, at greater depths the onset of silicate melting (dissolu- 
tion in a carbonatite) takes place at progressively lower temperatures 
with respect to the volatile-free peridotite solidus. We interpret the 
observations as indicating that this feature is caused by the diminishing 
influence of non-ideal mixing between the silicates and carbonates 
with increasing pressure; that is, increasing the separation of the 
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Figure 1 | Experimental conditions, phase assemblages, melt CO, isopleths 
and extent of melting as a function of temperature. a, Pressure-temperature 
plot. b, Temperature-melt-fraction plot. Results are shown from this (at 2 GPa 
(blue), 4 GPa (green) and 5 GPa (red)) and previous (at 3 GPa (grey)’”) studies. 
The grey lines in a show the modelled melt CO, isopleths for carbonated silicate 
melts between the carbonated peridotite solidus*”* and the CO,-free peridotite 
solidus'®; numbers along the lines are percentages by weight of CO) in the melt. 
The segment of the carbonated peridotite solidus deeper than ~250 km is 
relevant only for locally oxidized domains of the mantle. The inset to a shows a 
BSE image of a typical experimental sample with residual Ol, Opx and Gt and 
quenched carbonated silicate melt contained in graphite (Gr) capsule. Also 
shown for comparison are the solid mantle adiabats relevant for oceanic ridges. 


carbonated silicate melt liquidus from the metastable miscibility gap 
between carbonate and silicate melts (Supplementary Fig. 1). Indeed, 
the plot of carbonated silicate melt compositions (Supplementary 
Table 3), expressed in terms of molar CO2/(CO2+SiO3), as a function 
of pressure suggests that the transition from carbonate melt to silicate 
melt is sharp at lower pressures (for example, 2 GPa) but becomes 
more gradual with increasing pressure to 5 GPa (Fig. 2), which is 
consistent with observations in simplified and hydrous carbonated 
systems'*'”. Owing to higher bulk CO), many of our peridotite partial 
melting experiments produce Cpx-free and some Gt-free assemblages. 
However, additional experiments at conditions appropriate for the 
carbonatite-silicate melt transition for peridotite (that is, 3 GPa and 
1,375 °C) on more fertile bulk compositions (carbonated silicate melt 
fluxed peridotite) reveal that the mixing behaviour between carbonate- 
rich and silicate-rich melt depends little on peridotite bulk com- 
position. Carbonate-silicate major-element mixing, as revealed by 
SiO,-CO, and CaO-COQ) systematics of partial melts in all our experi- 
ments, is similar to that for partial melts that are saturated with 
Ol + Opx + Cpx + Gt of variable compositions from this and pre- 
vious studies (Supplementary Information and Supplementary 
Fig. 3). 

Our experiments show that, in the presence of CO;, appreciable 
melting or dissolution of mantle silicates at greater depths begins 
at temperatures significantly below the volatile-free solidus (Fig. 1b). 
However, Earth’s mantle beneath oceanic ridges contains only 50- 
100 p.p.m. CO, (refs. 18-20), and the extent of CO-induced melting 
is proportional to source CO}, Cio, (ref. 15). Thus, to extrapolate our 
experimental data to natural mantle peridotite and to estimate the 
extent of silicate melting (F) induced by trace CO2, we parameterize 
(following the approach of ref. 15) the temperature difference (AT) at 
a given melt fraction between the T-F trends for volatile-free perido- 
tite (Supplementary Information) and for carbonated peridotite (this 
study, and ref. 15) as a function of CO, content of the experimental 
melts, Ck, (Fig. 3). Figure 3 shows that at any given melt CO, content, 
AT increases with increasing P up to 5 GPa. If we assume that all the 
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In b, temperature—melt-fraction data from carbonated peridotite (curves with 
points; 2.5 wt% bulk CO,; this study and ref. 15) are compared with similar data 
in CO)-free systems (straight lines) from the literature (see Supplementary 
Information). For a given melt fraction, carbonated silicate melts are stabilized 
at a lower temperature than silicate melts from a volatile-free peridotite, and 
that temperature difference (for a constant CO, bulk composition) increases 
with increasing pressure from 2 to 5 GPa (marked by AT(P)). The fit to the data 
for carbonated silicate melt fraction versus temperature is based on AT(P) 
versus melt CO, contents (Fig. 3; Supplementary Information). The melt CO, 
isopleths in a are based on AT(P) at each pressure from 2, 3, 4 and 5 GPa. For 
clarity, the data for lwt% bulk CO, composition are not shown. Errors in b are 
lo with respect to the mean. 
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Figure 2 | Temperature-composition diagram showing evolution of melt 
composition. Melt compositions are expressed in terms of molar CO/ 

(CO, + SiO3), asa function of reduced temperature (T — Tyolatite-free peridotite solidus) 
for carbonated peridotite with 2.5 wt% (squares; this study and ref. 15) and 

1 wt% CO, (diamonds; this study). The transition from carbonatite to 
carbonated silicate melt is sharp at lower pressures (=3 GPa) but more gradual 
at higher pressures (=4 GPa). The onset of carbonated silicate melting 
(arbitrarily placed at a molar CO2/(CO2 + SiOz) of ~0.6—that is, a melt CO2 
content of ~25 wt%)—occurs at a lower temperature with respect to the 
volatile-free peridotite solidus at higher pressures. Compositional errors are lo. 
Reduced temperature uncertainties are <25 °C and are not plotted. 
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Figure 3 | Plot of AT as a function of concentration of CO, in the partial 
melts, Cs . ATis the difference between the T-F trends of CO,-free peridotite 
and those of CO-bearing peridotite. Results from this study (squares, 2.5 wt% 
bulk CO,; diamonds, 1 wt% bulk CO;) and from ref. 15 are shown. At a given 
Cho», AT increases with increasing pressure from 2 to 5 GPa, suggesting a larger 
influence of CO; on silicate melting at greater depth. Inset: the modelled C&, 
as a function of reduced temperature (Fig. 2) at 2, 3, 4 and 5 GPa for p.p.m. level 
bulk CO. Errors are 1¢ with respect to mean Cg). 


Melt fraction (wt%) 


LETTER 


carbon is dissolved in the melt, the Cho -AT(P) relations in Fig. 3 
allow us to estimate T — Choy and T — F (F= Choo! Cha) trends at 
each pressure and for any given C29). This analysis shows that trace 
carbonated silicate melting in the Earth’s upper mantle begins at 
~120°C below the volatile-free peridotite solidus at 2 GPa, ~160°C 
below the solidus at 3 GPa (ref. 15), ~210°C below the solidus at 
4 GPa, and ~280 °C below the solidus at 5 GPa. 

We determined both the slopes of Ck, isopleths in P-T' space 
(Fig. 1) and the depth-melt fraction, depth-C&,, trends for astheno- 
spheric mantle with trace CO, (Fig. 4). Our model shows that a dry 
silicate melt with 25 wt% dissolved CO, is stabilized as deep as ~180 km 
for a sub-ridge mantle with a potential temperature Tp of 1,350 °C 
(145 km for 1,315 °C and 230 km for 1,400 °C Tp) and that 100 p.p.m. 
CO, alone can induce silicate melting with a melt fraction as much as 
~0.07% at ~150 km depth beneath ridges. In contrast, previous esti- 
mates predicted a carbonatite melt fraction of only 0.02% at a similar 
depth’’. Asthenospheric mantle, however, also contains trace quan- 
tities of an H,O component; given the incompatible behaviour of 
HO during mantle melting’, the presence of H,O should deepen 
the onset of carbonated silicate melting'’, enhance the extent of 
melting at a given depth’®, and decrease Ck,,. To incorporate the 
effect of HO, we use the cryoscopic equation” with depth-dependent 
peridotite-melt partition coefficients for HzO, Dyno (ref. 9), by taking 
into account the possible effect of dissolved carbonate in the melt on 
Dy20 (Supplementary Information). Our calculations suggest that 
with 100 p.p.m. CO, and 50-200 p.p.m. H,O in the sub-ridge mantle, 
1-5 wt% H,O can be dissolved into deep carbonated silicate melt with 
~25 wt% CO, and that this concentration of dissolved H,O will cause 
the onset of wet, carbonated melting to be as deep as 220-300 km. As 
much as 0.06-0.1 wt% wet carbonated silicate melt is generated by 
~150 km depth, and 0.25 wt% by 100 km. For peridotite with 300 p.p.m. 
CO, and 300-500 p.p.m. H,0, similar to that in the source of enriched 
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Figure 4 | Melting regime and mantle flow beneath a mid-oceanic ridge 
along a 1,350 °C mantle potential temperature (T,) adiabat. Also shown are 
the extent of equilibrium melting, dehydration and melt composition of 
peridotite. a, The depth of onset of silicate melting (from stippled to hatched 
region) is relevant for a mantle with ~50 p.p.m. H2O and 100 p.p.m. COs. Also 
shown are the silicate solidi of damp (50-200 p.p.m. H,O) peridotite (blue 
band)”, volatile-free peridotite'®, quartz eclogite (recycled oceanic crust)*', and 
dry carbonated peridotite (this study). The transition from carbonatite to 
carbonated silicate melt is = 110-150 km deeper than the solidus of volatile- 
free peridotite. The horizontal ellipses mark the depths of impedance contrast 
in oceanic mantle X-discontinuity** and the grey shaded region marks the 
stability of Ni-Fe metal’. b, Residual peridotite H,O content (thin light blue, 
equilibrium melting; bold dark blue, fractional melting) is shown for two bulk 
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compositions that are at their H,O storage capacity (between 500 p.p.m. 
(dashed) and 700 p.p.m. (solid)) atop the transition zone and have 300 p.p.m. 
CO). The grey shaded region indicates that the curves are metastable owing to 
redox freezing of carbonate component with enhanced stability of Fe** in 
majorite and stability of Ni-Fe metal’. c, Electrical conductivity of the north 
Pacific’ (green shaded region) and Philippine Sea mantle”® (green curve) 
compared with the modelled Ck, and C§o, for fertile peridotite with different 
CO,-H,0O compositions. Solid segments are relevant where redox melting 
defines the solidus at ~250 km depth (blue shaded band). Dashed segments are 
relevant for oxidized domains. Because H,O enhances melting and dilutes 
Ck, if highly conductive melt with >15-30 wt% CO, is required to produce 
the observed electrical conductivity anomaly at 180-250 km depth, the mantle 
needs to be H,O-poor. 
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oceanic basalts****, the equilibrium melt fraction can be as much as 
1.5 wt% at 100 km, even for a mantle with a potential temperature 
similar to that below ridges (Fig. 4b). 

The depth of onset of wet, carbonated silicate melting beneath ridges 
suggests that carbonatite melt stability in the oceanic upper mantle is 
restricted to 2220-300 km (Fig. 4c). However, the deep stability of 
carbonatite on a global scale is probably eliminated, because the depth 
of transition of carbonatite to carbonated silicate melt exceeds the 
depth of metal saturation and diamond stability’® for mantle with 
>50 p.p.m. H,O (Fig. 4c). Our results, along with the previous experi- 
ments on mantle redox state, suggest that the first-formed melt in the 
upwelling mantle beneath ridges, globally, is a carbonated silicate melt 
with a near-solidus melt fraction as much as 0.05-0.06 wt% at ~250- 
220km (Fig. 4b). This melting commences at a redox front, where 
metal and majorite (the main host of Fe**) become unstable’, pro- 
moting the stability of a carbonated silicate melt over diamond, 
carbide, carbon-bearing metal alloy or alloy melt”. Because mantle 
electrical conductivity profiles” require a melt with >15 wt% CO, 
(refs 7, 27), our data also suggest that the sub-ridge mantle at depths 
is CO,-rich but H,O-poor (=200 p.p.m.). Higher HO content in the 
mantle enhances melting and lowers the CO, content (Fig. 4c) and 
the conductivity of such melts. Redox melting at ~250 km can gen- 
erate a carbonatitic melt (~38 wt% CO.) for a dry mantle, and the melt 
CO, content remains high enough (>15 wt% CO2) for a mantle with 
=200 p.p.m. H,O to explain the peak and elevated electrical conduc- 
tivity beneath oceans at ~220 and 100-250 km depth, respectively”. 
Thus, our study helps to reconcile the geophysical observations of the 
oceanic low-velocity zone down to ~200km, the highly conductive 
asthenosphere, and the petrological models of mantle melting and 
redox (Fig. 4). Moreover, the depth for the onset of carbonated silicate 
melting may be similar to the depth of the X-discontinuity (270- 
330km) in oceanic mantle”, if the mantle is locally oxidized. We 
suggest that the appearance of hydrous carbonated silicate melt in 
oxidized mantle might explain the distinct impedance contrast across 
~250-300km that is observed for the seismic X-discontinuity. We 
note that the near-zero dihedral angle of the melt-mantle system at 
these depths implies that even a trace amount of melt will have large 
influence on the physical properties”. 

Our experiments and model further allow us to calculate the HO 
depletion of nominally anhydrous mantle as a function of depth 
beneath ridges (Fig. 4b) by using depth-dependent partition coeffi- 
cients of HO between peridotite and basaltic melt’? (Supplementary 
Information) and the trend in melt fraction against depth for sub-ridge 
mantle. These calculations suggest that, even for the extreme case of 
a mantle that is at its H,O storage capacity atop the transition 
zone (perhaps resulting from hydrous melting across the 410-km 
discontinuity*’), volatile depletion owing to deep carbonated silicate 
melt generation can result a sub-ridge residual mantle with =50 p.p.m. 
H,0 (Fig. 4b). However, the combined data on mantle electrical con- 
ductivity and volatile inventory of MORB suggest that deep mantle 
contains =200 p.p.m. H,O. Finally, deep commencement of carbo- 
nated silicate melting in the Earth’s upper mantle suggests that a 
kimberlitic melt with 20-30 wt% SiO, (Fig. 4c and Supplementary 
Fig. 3), rather than carbonatite, controls the flux of highly incom- 
patible elements, HO and CO) at mid-oceanic ridges through mixing 
with the shallow depleted melts. 


METHODS SUMMARY 


Experiments at high pressure and temperature were performed with piston- 
cylinder and multi-anvil devices to simulate Earth’s oceanic upper-mantle condi- 
tions (2-5 GPa; 1,300-1,750 °C). Starting compositions included natural mineral 
mixes equivalent to fertile mantle peridotite with 1.0 wt% and 2.5 wt% CO, and 
synthetic mixes with variable CO, contents. All starting mixes were dried to 
achieve nominally anhydrous conditions and reduced at an fO, lower than that 
of quartz—fayalite-magnetite buffer. All mixes were contained in graphite capsules 
with a platinum outer liner. Experimental products were analysed with a field- 
emission-gun scanning electron microscope and an electron probe microanalyser 
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for texture and major-element compositions. To obtain representative melt com- 
positions from a quenched melt pool comprising dendrites, a defocused electron 
beam was employed on serially polished surfaces. Mass balance calculations were 
used to estimate phase proportions in all experiments. Dissolved CO) contents of 
melts were estimated by electron microprobe deficits and also using mass balance 
calculations. To quantify the freezing-point depression and the enhanced melt 
production effect of trace CO on silicate partial melting of peridotite, AT, the 
temperature difference between the T-F trend for volatile-free peridotite and 
that for carbonated peridotite (this study and ref. 15) was fitted as a function of 
dissolved CO, of the melt, Chess The AT-Co) trends at each pressure were 
fitted with an empirical function (ref. 15). Using this parameterization, the Ck, 
isopleths in pressure-temperature space were obtained. To obtain the combined 
effect of dissolved CO and H;0O in stabilizing incipient silicate melt below the 
volatile-free peridotite solidus, the freezing-point depression effect of each species 
was added; that is, AT total = AT(Cko)) + AT(Chy9), where both Ck, and Chi 
vary as a function of depth, the first owing to pressure-dependent mixing of silicate 
and carbonate melt components as constrained in this study, and the second 
owing to the pressure dependence of DP@°t"*/™*"" (ref. 9). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


The starting compositions for peridotite + CO, partial melting experiments (PERC 
and PERC3: MixKLB-1 + mixture of carbonates) with 2.5wt% and 1.0wt% bulk 
CO, were the same materials used in refs 2 and 15 (Supplementary Table 1). This 
ensured reliable comparison between present and previous data (ref. 15). The base 
silicate powder was dried and reduced at fo, lower than that buffered by quartz— 
fayalite-magnetite (QFM-1). The mix was then reground under ethanol and CO, 
was added as a mixture of natural and reagent grade (CaCO3, NaxCO3, K2COs) 
carbonates in a manner that kept the Ca:Mg:Fe:Na:K ratio of the base peridotite 
(MixKLB-1) unaffected by CO, addition. This approach of CO, addition has been 
demonstrated to minimize the effect of overabundances of carbonates/CO, on the 
melting-phase relations of peridotite**. To test the nature of carbonate-silicate 
mixing, as captured by the compositions of carbonated silicate melts, as a function 
of variable peridotite compositions, additional experiments were performed on 
more fertile bulk compositions (Supplementary Table 1). These additional starting 
compositions were prepared with reagent-grade oxides and carbonates. All the 
carbonate compounds were fired in a 1-atm furnace at 110-300°C for 4-12h 
before mixing them with silicates under ethanol. Homogeneous mixtures of 
carbonate-bearing peridotite mix were loaded in graphite crucibles with platinum 
outer capsules (3.5 mm outer diameter for piston-cylinder (PC) experiments and 
2.0mm outer diameter for multi-anvil (MA) experiments). The loaded capsules 
were placed in a drying oven at 110°C for a couple of hours before welding. PC 
experiments were conducted at Rice University, and MA experiments were con- 
ducted at the University of Minnesota. The PC and MA experiments followed the 
procedures, assemblies and P-T calibrations described and reported in detail in 
refs 33 and 34, respectively. Pressure uncertainties are believed to be +0.1 GPa for 
PC and +0.3 GPa for MA experiments, and the temperature uncertainties are 
+10 °C for both the devices. The redox states during the experiments were not 
controlled strictly; however, the association of orthopyroxene, carbonated melt, 
olivine, clinopyroxene and graphite/diamond (inner capsule) allowed calculation 
of oxygen fugacity following the buffer reactions EDDOG/EDDOD (reaction 1) 
and/or EMOG/EMOD (reaction 2): 


2MgoSi,0¢ + CaMg(CO3). = CaMgSi,0g + 2Mg,SiO, + 2C +O, (1) 
MgoSix0¢ + 2MgCO3 = 2Mg,SiO4 + 2C + 202 (2) 


Estimated oxygen fugacities based on a recent calibration’’, which takes into 
account the effect of dilution of CO, in the melts on the equilibrium oxygen 
fugacity, are reported in Supplementary Table 2. 

Run products were polished on soft nylon and velvet cloth by using dry poly- 
crystalline diamond powders to achieve a 0.3 um finish. H,O and other liquid 
lubricants were avoided during polishing to minimize damage to the quenched 
carbonate-bearing melt. Textural and wavelength dispersive spectroscopic (WDS) 
phase compositional analysis of the carbon-coated experimental samples was 


performed with a Cameca SX-100 electron probe at NASA Johnson Space 
Center (peridotite + CO, experiments) and a Cameca SX-50 electron probe at 
Texas A&M University (carbonated silicate melt fluxed peridotite experiments). 
Analytical standards included a synthetic basalt glass (Ca, Mg, Fe, Si), an Indian 
Ocean basalt (Ca, Al, Si, Mg), olivine (Mg, Si, Fe), calcite (Ca), dolomite (Mg, Ca), 
Cpx (Ca, Mg, Fe, Si), enstatite (Mg, Si), pyrope (Mg, Al, Si), andradite (Ca, Al, Fe), 
oligoclase/albite (Na, Al), rutile/ilmenite (Ti), rnodonite (Mn), spessartine (Mn), 
potassium feldspar (Al, K) and chromite (Cr). To obtain meaningful average melt 
compositions from heterogeneously quenched dendritic melt pool texture, a 
defocused (5-30-|1m spot size) electron beam was employed. Moreover, to obtain 
as many spot analyses as possible, two to four new longitudinal surfaces of each 
sample were exposed by serial polishing. To avoid sampling melt compositions 
changed by quench modification, only the melt regions that are separated from the 
residual mineral assemblage domains (Fig. 1) were analysed. Dissolved CO con- 
centrations of partial melts were estimated both by the difference between electron 
probe totals and 100wt% and by mass balance calculations (Supplementary Table 
3). Because the two approaches yield similar values (Supplementary Fig. 4), those 
obtained from electron microprobe totals are plotted in all figures. Supplementary 
Table 2 reports the experimental conditions, obtained phase assemblages and 
phase proportions obtained by mass balance calculation. Supplementary Table 3 
reports the major-element compositions of all resulting phases. 

Freezing-point depression and the enhanced melt production effect of CO, on 
partial melting of peridotitic silicates was calculated by fitting AT, the temperature 
difference between the T-F trend for volatile-free peridotite and that for carbo- 
nated peridotite (this study, and ref. 15) as a function of dissolved CO, of the melt, 
C&o)- The AT-C&,, trends at each pressure are nearly linear up to a C&o, of 
~25wt% and were fitted with an empirical function, 


AT(C) =a X Céo9 +b x In[(100 —c x C& 5) /100] 


where Cé,,is in percentage by weight and a, b and c are constants that vary as a 

function of pressure (Supplementary Information). 

To obtain the combined effect of dissolved CO, and H,O in stabilizing 
minute amounts of silicate melt below the volatile-free peridotite solidus, 
we combined the freezing-point depression effect of each species; that is, 
ATrotai = AT(CEo2) + AT(Cfn9)» where both C& ) and Ci, vary as a function 
of depth, the first owing to pressure-dependent mixing of silicate and carbonate 
melt components as constrained in this study and the second owing to the pressure 
dependence of DPSet*/™*" (ref. 9), 

32. Dasgupta, R. & Hirschmann, M. M. Effect of variable carbonate concentration on 
the solidus of mantle peridotite. Am. Mineral. 92, 370-379 (2007). 

33. Tsuno, K. & Dasgupta, R. Melting phase relation of nominally anhydrous, 
carbonated pelitic-eclogite at 2.5-3.0 GPa and deep cycling of sedimentary 
carbon. Contrib. Mineral. Petrol. 161, 743-763 (2011). 

34. Dasgupta, R., Hirschmann, M. M. & Withers, A. C. Deep global cycling of carbon 
constrained by the solidus of anhydrous, carbonated eclogite under upper 
mantle conditions. Earth Planet. Sci. Lett 227, 73-85 (2004). 
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Crystallographic snapshot of cellulose 
synthesis and membrane translocation 


Jacob L. W. Morgan!, Joanna Strumillo"* & Jochen Zimmer! 


Cellulose, the most abundant biological macromolecule, is an extracellular, linear polymer of glucose molecules. It 
represents an essential component of plant cell walls but is also found in algae and bacteria. In bacteria, cellulose 
production frequently correlates with the formation of biofilms, a sessile, multicellular growth form. Cellulose 
synthesis and transport across the inner bacterial membrane is mediated by a complex of the membrane-integrated 
catalytic BcsA subunit and the membrane-anchored, periplasmic BcsB protein. Here we present the crystal structure of 
a complex of BcsA and BcsB from Rhodobacter sphaeroides containing a translocating polysaccharide. The structure of 
the BcsA-BcsB translocation intermediate reveals the architecture of the cellulose synthase, demonstrates how BcsA 
forms a cellulose-conducting channel, and suggests a model for the coupling of cellulose synthesis and translocation in 
which the nascent polysaccharide is extended by one glucose molecule at a time. 


Polysaccharides perform multiple vital functions in life, as reflected 
in their great variety in chemical composition and physicochemical 
properties. Extracellular polysaccharides, such as cellulose, chitin and 
hyaluronan, primarily perform structural functions and are synthe- 
sized inside the cell from nucleotide-activated donor sugars’. 

Cellulose is the most abundant biological polymer and consists of 
glucose molecules that are connected between their Cl and C4 car- 
bons via acetal linkages*. It is predominantly generated by vascular 
plants and a large number of algae, but also by some bacteria®®, pro- 
tists’ and tunicates*. Cellulose synthases (CESAs) are membrane- 
embedded glycosyltransferases, which use UDP-activated glucose 
(UDP-Glc) to elongate the nascent polysaccharide processively in a 
reaction that inverts the configuration at the anomeric carbon of the 
newly added sugar from to B**!°. Prokaryotic and eukaryotic CESAs 
share a similar predicted topology including eight transmembrane 
helices and at least one extended intracellular glycosyltransferase 
(GT) loop (Supplementary Fig. 1). 

Bacteria, primarily Gram-negatives, produce and secrete cellulose via 
a protein complex consisting of at least three subunits (BcsA, BcsB and 
BcsC)". The inner membrane protein BcsA is the catalytically active 
subunit and contains a conserved family-2 GT domain between trans- 
membrane helices 4 and 5 ([M4 and TM5)". BcsB is a periplasmic 
protein that is anchored to the inner membrane via a single, carboxy- 
terminal transmembrane helix. BcsA and BcsB are fused as a single 
polypeptide in some species, supporting the genetic observation that 
BcsB is essential for cellulose synthesis'*"*. BcsC, required for cellulose 
synthesis in vivo but not in vitro™, is predicted to form an 18-stranded 
B-barrel in the outer membrane. Some bacteria contain additional, non- 
essential, periplasmic subunits of unknown function’. 

Although some bacteria constitutively produce cellulose’, bacterial 
cellulose production is often concomitant with the formation of 
sessile, multicellular aggregates, termed biofilms’’’®. Biofilm bacteria 
are encased in a network of polysaccharides and proteinaceous 
fibres and are of particular concern to human health due to an 
increased tolerance to antimicrobial treatments’’. Biofilm formation 
is stimulated by the bacterial secondary messenger cyclic-di-GMP, 
which bacteria sense via PilZ domains'*. Accordingly, BcsA contains 


a PilZ domain within its C-terminal, intracellular domain and its 
activity is strongly stimulated by cyclic-di-GMP”””. 

In all organisms, the cellulose polymer has to be exported across 
the plasma membrane. However, the mechanism by which a polymer 
consisting of thousands of glucose units can be translocated has been 
unclear. To understand this process, we determined the crystal struc- 
ture of the catalytically active BcsA—BcsB complex in an intermediate 
state during cellulose synthesis and translocation at 3.25 A resolution 
(Supplementary Table 1). 


Architecture of the BcsA-BcsB complex 


BcsA and BcsB form a 1:1 stoichiometric complex spanning appro- 
ximately 150 A perpendicular and 55 A parallel to the membrane. The 
complex is divided into a cuboid-shaped membrane-spanning region 
sandwiched between large cytoplasmic and periplasmic domains 
(Fig. 1a). The BcsA-BesB interface is approximately 4,500 A? inclu- 
ding a cluster of negative and positive charges on BcsA and BcsB, 
respectively (Supplementary Fig. 2). 

BcsA contains four amino-terminal and four carboxy-terminal 
transmembrane helices separated by a large intracellular loop (4/5 
loop) that forms a GT domain (amino acids 128-368) (Fig. la and 
Supplementary Fig. 3). TM3-8 form a narrow channel for the trans- 
locating polysaccharide, and BcsA’s intracellular C terminus (amino 
acids 575-759) contains a six-stranded f-barrel and a highly curved 
a-helical region that attaches the B-barrel to the GT domain (Fig. 1a). 

BcsB is a dome-shaped, f-strand-rich, periplasmic protein. Its 
N-terminal region forms the tip of the dome, whereas the C-terminal 
transmembrane anchor interacts with BcsA (Fig. 1a). Two amphipathic 
helices further stabilize its interaction with BcsA and the periplasmic 
water-membrane interface (Fig. 1a). 

Throughout model building, we noticed a continuous, strong posi- 
tive difference Fourier electron density that could not be accounted 
for by BcsA or BcsB (Fig. 1b). The ribbon-shaped electron density 
extended from BcsA’s GT domain in a 45° angle towards the mem- 
brane, kinked near the putative water-lipid interface to run through 
the centre of the transmembrane region, and bent sharply at the 
BcsA-BcsB interface to emerge from the complex at approximately 
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Figure 1 | Architecture of the BcsA-BcsB complex. a, BcsA and BcsB form 
an elongated complex with large cytosolic and periplasmic domains. The 
transmembrane helices of BcsA are coloured green, the glycosyltransferase 
(GT) domain beige, and the C-terminal domain red. BesB is shown in light and 
dark blue for its periplasmic and membrane-associated regions, respectively. 
The N and C termini of both subunits are indicated and the translocating 
glucan and UDP are shown as cyan and violet spheres. Horizontal bars indicate 


145° (Fig. 1b). Past the BcsA-BcsB interface, the electron density 
extended into the bulk solvent. The characteristic shape of the electron 
density, its proximity to conserved residues throughout its path (see 
below), its origin at the glycosyltransferase catalytic site, and extension 
past the BcsA—BcsB complex strongly suggest that it represents a trans- 
locating glucan probably synthesized during or before crystallization. 
The catalytic activity of the purified, detergent-solubilized BcsA—BcsB 
complex was confirmed by incubating it with UDP-Glc and cyclic- 
di-GMP. The synthesized polymer, that could be completely degraded 
by B-1,4-glucanase (data not shown), was sedimented, purified and 
analysed by glycosyl linkage analysis (performed by the Complex 
Carbohydrate Research Center, University of Georgia), confirming that 
94% of it consists of 4-linked glucose (Supplementary Fig. 4). 
Although probably reflecting an average of different registers of the 
translocating glucan, the density discerns the overall position of 18 
glucose molecules (Fig. 1b and Supplementary Fig. 5). According to 
the prevailing model that cellulose synthase elongates the non-reducing 
end of the growing polymer’”’, the glucan was modelled with its non- 
reducing end (glucose-18) at the glycosyltransferase catalytic site. At the 
active site, we observed additional, weak electron density detached from 
the glucan. On the basis of its shape and coordination by residues that 
are predicted to be implicated in nucleotide binding and glycosyl 
transfer’, it seemingly represents a weakly bound UDP molecule 
(Fig. 1b and Supplementary Fig. 5). Therefore, our structure probably 
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the membrane boundaries. IF, amphipathic interface helices of BcsA. 

b, Unbiased F,, — F, difference Fourier electron density (pink mesh, contoured 
at 4.50) calculated before modelling the glucan and UDP molecules. The 
continuous density runs from the intracellular catalytic site to the periplasmic 
BcsA-BcsB interface and accommodates 18 glucose molecules (cyan sticks). 
The UDP density was contoured at 3o (right panel). 


represents a cellulose synthase-cellulose translocation intermediate 
captured after glycosyl transfer and before replacing UDP with UDP- 
Gle. 


The glycosyltransferase domain 
The GT domain of BcsA contains the conserved signature D,D,D,Q 
(Q/R)XRW of three variably spaced aspartic acids and a pentapeptide 
consisting of a Gln that is often followed by a Gln or Arg, a variable 
residue, and an Arg and Trp. This signature is shared with other proces- 
sive B-glycosyltransferases, such as chitin and hyaluronan synthases”. 
The GT domain adopts a glycosyltransferase A fold (GT-A)'° con- 
sisting of a mixed, seven-stranded [-sheet that is surrounded by seven 
a-helices and attaches to the transmembrane region via three amphi- 
pathic interface (IF) helices (IF1-3) (Figs 1-3 and Supplementary 
Fig. 3). IFl is formed by residues belonging to the 4/5 loop (amino 
acids 298-312) and packs against the C termini of BcsA’s TM8 and 
TM4 as well as the N terminus of TMS5. The second helix, IF2 (amino 
acids 373-395), directly precedes TM5 and includes the invariant 
Q(Q/R)XRW sequence of the signature. IF3 (amino acids 470-498) 
follows TM6 and forms a crescent-shaped helix around the cytosolic 
entrance to the transmembrane channel (discussed below). It is con- 
nected to TM7 via a conserved loop (IF3-TM7 loop) that runs across 
the entrance to the GT substrate-binding site (Fig. 2b and Sup- 
plementary Fig. 3). The GT domain protrudes from these IF helices 
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at approximately 45° (Fig. 1a). The inverting glycosyltransferase SpsA 
from Bacillus subtilis’? superimposes with BcsA’s GT domain with 
2.15A root mean square deviation (r.m.s.d.) between Ca atoms 
(Supplementary Fig. 6). 

The structure of the cellulose synthase reveals the function of the sig- 
nature D,D,D,Q(Q/R)XRW as well as other highly conserved sequence 
motifs. Consistent with non-processive glycosyltransferases****, the 
first two aspartic acids of the signature, Asp 179 and Asp 246, coordinate 
UDP (Fig. 2a). The third ‘D’ (Asp 343), which is part of the invariant 
“TED” motif, probably represents the catalytic base owing to its proximity 
to the non-reducing end of the glucan (Fig. 2a). 

The Q(Q/R)XRW part of the signature, together with an equally 
conserved FFCGS sequence, forms a binding site for the terminal 
disaccharide of the glucan, the acceptor (Fig. 2a). The Q(Q/R)XRW 
sequence belonging to IF2 is a part of the cytoplasmic entry to the 
glucan channel (Fig. 3). The first Q, Gln 379, forms a hydrogen bond 
to the guanidino group of the neighbouring Arg 380, which in turn 
forms a salt bridge to the invariant Glu 297. Arg 382, preceding the 
conserved Trp residue, most likely coordinates the diphosphate of 
UDP; however, the electron density for its side chain is weak, consis- 
tent with the apparent low occupancy of UDP. Trp 383 forms van der 
Waals interactions with the penultimate glucose molecule at the 
acceptor site (glucose-17), and packs against the Gln 379-Arg 380 pair 
towards the distal site of the catalytic pocket (Fig. 2a). The FFCGS 
pentapeptide (amino acids 316-320) interacts with the opposite side 
of the terminal disaccharide, primarily via the backbone carbonyls of 
Cys 318 and Gly 319 with glucose-18. 

The dominant-negative thanatos mutation (Pro578Ser) in Arabi- 
dopsis thaliana CESA3” maps to the invariant QTPH sequence in bac- 
terial cellulose synthases (Supplementary Fig. 1). His 276 of this motif is 
within hydrogen-bonding distance to glucose-17 (Fig. 2a) and is thus 
probably also involved in positioning the non-reducing end with respect 
to the catalytic base. The motif is part of an extended f-strand and 
Pro 275 (equivalent to Pro 578 in A. thaliana CESA3) induces a ‘kink’ 
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in the B-strand that positions the side chain of His 276 towards the glucan, 
which might be disrupted when Pro is replaced by another amino acid. 

A cavity next to the UDP-binding pocket might accommodate the 
glucose donor in the UDP-Glc-bound state. This pocket is flanked on 
one side by Ala 225 and Lys 226 of the invariant AKAGN motif and, 
on the other side, by Glu342 (of the conserved TED motif) and 
His 224 (Fig. 2a and Supplementary Fig. 7). 


Activation by cyclic-di-GMP 

The activity of BcsA is stimulated by cyclic-di-GMP”*. The cyclic- 
di-GMP-responsive PilZ domain is localized within BcsA’s C ter- 
minus’’, right next to the GT domain (Fig. 2b). The C terminus of 
BcsA extends from the last transmembrane helix (TMS8) via a short 
B-strand, which forms a two-stranded B-sheet with the IF3-TM7 
loop, before it folds into a six-stranded B-barrel (amino acids 585- 
675). The B-barrel points away from the GT domain at approximately 
90°. Past the B-barrel, the polypeptide chain continues in an «-helical 
conformation on the surface of the GT domain towards the water- 
lipid interface. At the interface, it forms an amphipathic helix that 
interacts with TM6, IF3 and TM8 (Fig. 1a) before it curves away from 
the membrane towards the GT domain and breaks at Glu 743. After 
Glu 743, the polypeptide continues in an extended conformation 
towards the B-barrel, past the substrate-binding pocket. 

The B-barrel of BcsA aligns with an r.m.s.d. of 2.2 A between Ca 
atoms with the cyclic-di-GMP binding protein VCA0042** (Sup- 
plementary Fig. 8). The conserved residues Arg584, Asp 609, 
Ser 611 and Gly 614 on the B-barrel surface as well as Gln 578 and 
Arg 579 of the TM8-[-barrel linker are probably implicated in cyclic- 
di-GMP binding” (Fig. 2b and Supplementary Fig. 8). We hypothe- 
size that cyclic-di-GMP binding to BcsA induces conformational 
changes that allow UDP-Glc to access the catalytic site. A likely can- 
didate for displacement is the IF3-TM7 loop that runs across the 
entrance to the catalytic pocket and interacts with the TM8-f-barrel 
linker (Fig. 2b and Supplementary Fig. 3). Notably, replacing Thr 


Figure 2 | Organization of BcsA’s catalytic site 
and PilZ domain. a, Conserved residues of BcsA 
coordinate UDP and the terminal disaccharide of 
the glucan. Side chains represented in sticks belong 
to the sequence motifs shown in single letter code. 
Conserved residues are highlighted in bold and the 
depicted residues are underlined. All side chains 
shown could be positioned unambiguously in the 
electron density map. No density was observed for 
the side chains of Lys 226 and Arg 382 (shown as 
spheres for their Cx atoms). The probable position 
of the donor Glc is indicated by a dashed ellipsoid. 
b, Surface representation of BcsA with its 
C-terminal domain shown as cartoon in red. The 
linker connecting TM8 with the B-barrel forms a 
two-stranded f-sheet (shown in cyan) with the 
IF3-TM7 loop. TM7 is coloured dark grey and IF3 
is orange. The Co atoms of residues likely involved 
in cyclic-di-GMP binding are shown as spheres and 
are labelled. The horizontal bar indicates the 
cytoplasmic membrane boundary. 
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942 with Ile in A. thaliana CESA3 confers resistance to the herbi- 


cide isoxaben”. The corresponding, equally conserved residue in our 
structure is Thr 506 in the IF3-TM7 loop (Supplementary Fig. 1), 
supporting the hypothesis that this loop is involved in regulating 
the activity of BcsA. 


The transmembrane region and polysaccharide channel 
The transmembrane region of the cellulose synthase includes eight 
transmembrane helices from BcsA (TM1-8) and one from BcsB 
(Fig. 3a). The helices are framed by three cytoplasmic and two peri- 
plasmic IF-helices formed by BcsA and BcsB, respectively. In addition, 
the N-terminal four turns of TM5 also run parallel to the membrane 
before the helix bends towards the hydrocarbon core at Pro 420 
(Fig. 3a). All transmembrane helices, except TM6 and TM7 and 
BcsB’s transmembrane helix, tilt with respect to the membrane normal 
and surround a narrow channel approximately 8 A wide and 33 A long. 
Because the transmembrane helices are organized in pairs in one direc- 
tion (helices 1/2, 3/6, 4/8 and 5/7 of BcsA), their overall arrangement 
resembles an elongated cuboid, and BcsB’s transmembrane anchor 
packs against one side of the cuboid (Fig. 3a). On the membrane 
periplasmic side, BcsB forms two amphipathic helices (IF4 and IF5) 
at the water-lipid interface. IF4 (amino acids 569 to 592) interacts with 
the periplasmic termini of BcsA’s TM2, TM6 and TM8, and IF5 
(amino acids 679-693) directly precedes BcsB’s membrane anchor 
(amino acids 695-720) and contacts BcsA’s TM1 and the 1/2 loop 
(Fig. 3a). 

The channel formed by BcsA’s TM3-8 accommodates ten glucose 
units of the translocating glucan (Fig. 3). The glucan enters the channel 
through the cytoplasmic opening formed by IF1-3 and the N-terminal 
half of TM5, crosses the membrane parallel to TM5 and TM6, and exits 
on the periplasmic side between BcsA’s 5/6 and 7/8 loops at the BcsA- 
BcsB interface. Here, the glucan kinks to protrude from the complex 
sideways (Fig. 3b). TM6 and TMB are separated by about 9 Aandi5A 
(between Cx atoms) at the periplasmic and cytoplasmic sides of the 
membrane, respectively, and only the tips of their side chains maintain 
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Figure 3 | The membrane- 
integrated domain of BcsA-BcsB. 
a, The transmembrane region 
includes nine transmembrane helices 
and three cytoplasmic and two 
periplasmic interface helices (IF1-5). 
BcsA and BcsB are shown as green 
and blue ribbons, respectively. 
Transmembrane and 

interface helices are show in cartoon 
representation and coloured in 
shades of green and orange for 
BcsA’s TM1-8 and IF1-3, 
respectively, and shades of blue for 
BcsB’s IF4 and IF5 and its C-terminal 
transmembrane anchor. TM3-8 of 
BcsA form a narrow channel. Its 
cytoplasmic entrance is formed by 
IF1-3 as well as the N-terminal half 
of TM5 and its periplasmic exit is 
between the periplasmic 5/6 and 7/8 
loops of BcsA. Trp 383 of the Q(Q/ 
R)XRW motif is shown in stick 
representation. b, Conserved 
residues of BcsA interact with the 
translocating glucan. BcsA and BcsB 
are shown as ribbons, and side chains 
of BcsA contacting the glucan (cyan 
surface) are shown as sticks. Residues 
of BcsB contacting the polymer are 
not conserved and are not shown. 


the barrier towards the lipids (Fig. 3a). In the absence ofa translocating 
glucan, transmembrane helices 7 and 8 might move towards the 
N-terminal transmembrane helices to close the channel. 

All transmembrane helices of BcsA, with the exception of TM1 and 
TM2, directly contact the glucan. The faces of the glucopyranose rings 
form van der Waals interactions with residues Met 300 and Phe 301 
of IF1, Phe316 directly proceeding IF1, Trp 383 of IF2, Phe 419, 
Phe 426 and Tyr 433 of TM5, Phe 441 in the 5/6 loop, and Val 551, 
Val 555 and Trp 558 of TM8 (Fig. 3b). The equatorial hydroxyl groups 
of the glucose units form hydrophilic contacts with Tyr 80 of TM3, 
Asn 118 of TM4, His 276 of the GT domain, Asn 412 and Arg 423 of 
TM5, Glu 439 in the TM5/6 loop, Tyr 455 of TM6, and Ser 476 and 
Glu 477 of IF3. We note that the quinoxyphen-resistance mutation 
Ala903Val in A. thaliana CESA1* aligns with Tyr 455 in TM6, which 
forms a hydrogen bond to the translocating glucan (Fig. 3b and Sup- 
plementary Fig. 1). Therefore, although the sequence alignment in 
this region is weak, it is likely that quinoxyphen affects the transloca- 
tion rather than the synthesis of the glucan. 


The periplasmic domain 


BcsB protrudes approximately 60A into the periplasm with its 
N-terminal half (amino acids 54-307) forming the membrane distal 
part. BcsB consists primarily of B-strands that are organized into dis- 
tinct domains, including two jellyroll and two flavodoxin-like 
folds (Fig. 4). The jellyrolls (amino acids 54-185 and 316-452) show 
a striking structural similarity with carbohydrate binding modules 
(Supplementary Fig. 9)*' and are termed carbohydrate-binding domains 
(CBD) 1 and 2. Both CBDs contain eight antiparallel B-strands separated 
by loops 2-16 residues in length and adopt a similar fold, such that 
their Co atoms superimpose with an r.m.s.d. of 2.5 A (Supplementary 
Fig. 9). The CBDs are positioned at a 45° angle relative to one another 
and interact via two extended loops, including a disulphide bond 
between the conserved Cys 163 and Cys 430 (Fig. 4a). 

CBD2 runs parallel to the plane of the membrane above the peri- 
plasmic exit of the transmembrane channel. It interacts with BcsA’s 
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Cytoplasm 


TM2 and TM3, as well as with its 5/6 loop, thereby accounting for 
almost 30% of the complex interface. Although BcsB exhibits signifi- 
cant sequence variability, we noticed a cluster of conserved residues 
including His 159, Arg 160, Ile 161, Leu 171 and Trp 172 at the tip of 
CBD1, close to its disulphide bond with CBD2 and above the 
periplasmic exit of the glucan channel (Fig. 4b). A comparison of 
CBD1 with the bacterial carbohydrate binding module family 35 
(CBM35) in complex with a glucuronic acid disaccharide” localizes 
disaccharide binding to the cluster of conserved residues in CBD1 
(Supplementary Fig. 9), indicating that this region interacts with the 
translocating glucan. 

Both CBDs are linked to compact «/$ domains containing a central 
B-sheet five or six strands wide that is framed on either side by two 
a-helices (Fig. 4a). Although probably lacking functional similarity, 
both domains resemble a flavodoxin fold** (Supplementary Fig. 10) 
and are thus referred to as FD1 (amino acids 192-303) and FD2 
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Figure 5 | Proposed model for cellulose synthesis and translocation. After 
glycosyl transfer, the newly added Glc could rotate around the acetal linkage into 
the plane of the polymer. The rotation direction would be determined by steric 
interactions and formation of the B-1,4 glucan characteristic intramolecular 03- 
HeeO5 hydrogen bond. The glucan might translocate into the channel during 
this relaxation. This process would be repeated with a second UDP-Glc but the 
rotation direction after glycosyl transfer would be in the opposite direction 
owing to steric constraints. Alternatively, the glucan might not translocate into 
the channel until UDP is replaced by UDP-Glc. Trp 383 and Cys 318 mark the 
entrance to the transmembrane channel (only shown in the right panel). 
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Figure 4 | Organization of the periplasmic 
domain. a, The periplasmic region of the cellulose 
synthase is primarily formed by BcsB. It consists of 
two periplasmic carbohydrate binding domains 
(CBD1 and CBD2) that are connected to two a/f 
domains (FD1 and FD2). The CBDs are covalently 
attached via a disulphide bond between the 
conserved Cys 163 and Cys 430 (yellow spheres). 
BcsB’s transmembrane anchor and interface 
helices 4 and 5 are coloured in dark blue. The N and 
C termini of BcsB are indicated. b, Surface 
representation of BcsB coloured according to 
sequence conservation from red (variable) to deep 
blue (invariant). The tips of CBD1 and CBD2 form 
a patch of conserved, primarily hydrophobic 
residues above the periplasmic exit of the 
transmembrane channel. BcsA is shown as a grey 
surface and the glucan as purple spheres. 


(amino acids 457-529 together with 598-666). The repeating struc- 
tural motif of BcsB therefore contains a CBD linked to an FD domain. 


Implications for cellulose synthesis and translocation 
The structure of the cellulose synthase-cellulose translocation inter- 
mediate provides unique insights into the mechanism of cellulose 
synthesis and translocation. On the basis of the large distance between 
the glycosyltransferase site and the putative water-membrane inter- 
face (approximately 25 A; Fig. 1) as well as BcsA’s activity in detergent, 
a reaction mechanism involving lipid-linked intermediates seems 
unlikely’. The spatial restrictions in the substrate-binding pocket fur- 
ther suggest that the nascent glucan is extended by one** and not by 
two™ glucose molecules at a time. 

The distinct features of the glucan electron density (Fig. 1b) further 
indicate that the polymer translocates one glucose molecule at a time. 
The dimensions of the transmembrane channel require that the glu- 
copyranose rings have to be within the same plane to slide into the 
pore (Fig. 3b). Thus, we speculate that after glycosyl transfer the newly 
attached terminal glucose molecule will rotate around the acetal link- 
age**”* to align with the glucan in the channel (Fig. 5). Steric interac- 
tions might dictate the rotation direction, leading to the B-1,4 glucan 
characteristic 180° rotation between individual glucose units and 
intramolecular hydrogen bonding between ring oxygens and 3’ 
hydroxyl groups of neighbouring units*’’. This relaxation might be 
sufficient to allow the polymer to slide into the channel (Fig. 5). 
Alternatively, the elongated glucan might not translocate until UDP 
is replaced with UDP-Glc, necessitating translocation owing to spatial 
restrictions at the active site. Past the channel, the induced kink of the 
glucan chain at the BcsA-BcsB interface, the interaction with BcsB’s 
CBDs, or the aggregation with other glucans might further contribute 
to a unidirectional movement of the polymer. 

Cellulose, chitin and hyaluronan are probably synthesized by a con- 
served mechanism involving a membrane-embedded glycosyltransferase 
that couples polymer synthesis with its translocation across the cell mem- 
brane’. This unique mechanism stands in contrast to the translocation of 
most other biological polymers where ‘polymer synthases’ function inde- 
pendently of or alongside dedicated translocation machineries*“’. Our 
structure now provides a basis for unravelling the details of this process. 


METHODS SUMMARY 


The BcsA and BcsB subunits of the Rhodobacter sphaeroides cellulose synthase 
complex were co-expressed and purified by metal affinity and gel filtration chromato- 
graphy as described in detail in the Methods. The BcsA-BcsB complex was crystal- 
lized and the structure was determined after single anomalous dispersion phasing of a 
data set obtained from samarium 11-chloride-soaked crystal. Additional experimental 
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phases were obtained from selenomethionine-derivatized BcsA-BcsB crystals. The 
quality of the electron density differed between peripheral and transmembrane 
regions of the complex. Therefore, modelling of the N-terminal region of BcsB (amino 
acids 54-177) and amino acids 586-675 of BcsA relied on the position of bulky side 
chains, Se-Met positions and backbone electron density. The model was built in 
Coot** and includes all residues with the exception of the terminal residues 1-12 
and 760-788 of BcsA and 19-53 and 721-725 of BesB. In addition, a short loop of 
BcsB consisting of residues 532-543 was not visible in the electron density map. 


Full Methods and any associated references are available in the online version of 
the paper. 


Received 12 July; accepted 31 October 2012. 
Published online 9 December 2012. 


1. Somerville, C. Cellulose synthesis in higher plants. Annu. Rev. Cell Dev. Biol. 22, 
53-78 (2006). 

2. Merzendorfer, H. Insect chitin synthases: a review. J. Comp. Physiol. B 176, 1-15 (2006). 

3. Hubbard, C., McNamara, J. T., Azumaya, C., Patel, M. S. & Zimmer, J. The 
hyaluronan synthase catalyzes the synthesis and membrane translocation of 
hyaluronan. J. Mol. Biol. 418, 21-31 (2012). 

4. Nishiyama, Y., Langan, P. & Chanzy, H. Crystal structure and hydrogen-bonding 
system in cellulose IB from synchrotron X-ray and neutron fiber diffraction. J. Am. 
Chem. Soc. 124, 9074-9082 (2002). 

5. Matthysse, A. G., Thomas, D. L. & White, A. R. Mechanism of cellulose synthesis in 
Agrobacterium tumefaciens. J. Bacteriol. 177, 1076-1081 (1995). 

6. Bureau, T. E. & Brown, R. M. /n vitro synthesis of cellulose II from a cytoplasmic 
membrane fraction of Acetobacter xylinum. Proc. Natl Acad. Sci. USA 84, 
6985-6989 (1987). 

7. Grimson, M. J., Haigler, C. H. & Blanton, R. L. Cellulose microfibrils, cell motility, and 
plasma membrane protein organization change in parallel during culmination in 
Dictyostelium discoideum. J. Cell Sci. 109, 3079-3087 (1996). 

8. Kimura, S. & Itoh, T. New cellulose synthesizing complexes (terminal complexes) 
involved in animal cellulose biosynthesis in the tunicate Metandrocarpa uedai. 
Protoplasma 194, 151-163 (1996). 

9. Aloni, Y., Delmer, D. P. & Benziman, M. Achievement of high rates of in vitro 
synthesis of 1,4-beta-p-glucan: activation by cooperative interaction of the 
Acetobacter xylinum enzyme system with GTP, polyethylene glycol, and a protein 
factor. Proc. Natl Acad. Sci. USA 79, 6448-6452 (1982). 

10. Lairson, L. L., Henrissat, B., Davies, G. J. & Withers, S. G. Glycosyltransferases: 
structures, functions, and mechanisms. Annu. Rev. Biochem. 77, 521-555 (2008). 

11. Rémling, U. Molecular biology of cellulose production in bacteria. Res. Microbiol. 
153, 205-212 (2002). 

12. Cantarel, B. L. etal. The Carbohydrate-Active EnZymes database (CAZy): an expert 
resource for glycogenomics. Nucleic Acids Res. 37, D233-D238 (2009). 

13. Standal, R. eta/. Anew gene required for cellulose production and a gene encoding 
cellulolytic activity in Acetobacter xylinum are colocalized with the bcs operon. 

J. Bacteriol. 176, 665-672 (1994). 

14. Saxena, |. M., Kudlicka, K., Okuda, K. & Brown, R. M. Characterization of genes in the 
cellulose-synthesizing operon (acs operon) of Acetobacter xylinum: implications 
for cellulose crystallization. J. Bacteriol. 176, 5735-5752 (1994). 

15. Hu,S.Q. etal. Structure of bacterial cellulose synthase subunit D octamer with four 
inner passageways. Proc. Nat! Acad. Sci. USA 107, 17957-17961 (2010). 

16. Jahn, C.E,, Selimi, D.A., Barak, J.D. & Charkowski, A. O. The Dickeya dadantii biofilm 
matrix consists of cellulose nanofibres, and is an emergent property dependent 
upon the type Ill secretion system and the cellulose synthesis operon. Microbiology 
157, 2733-2744 (2011). 

17. Stewart, P. S. & Costerton, J. W. Antibiotic resistance of bacteria in biofilms. Lancet 
358, 135-138 (2001). 

18. Amikam, D. & Galperin, M. Y. PilZ domain is part of the bacterial c-di-GMP binding 
protein. Bioinformatics 22, 3-6 (2006). 

19. Rémling, U., Gomelsky, M. & Galperin, M. Y. C-di-GMP: the dawning of a novel 
bacterial signalling system. Mol. Microbiol. 57, 629-639 (2005). 

20. Ross, P. et al. Regulation of cellulose synthesis in Acetobacter xylinum by cyclic 
diguanylic acid. Nature 325, 279-281 (1987). 

21. Koyama, M., Helbert, W., Imai, T., Sugiyama, J. & Henrissat, B. Parallel-up structure 
evidences the molecular directionality during biosynthesis of bacterial cellulose. 
Proc. Natl Acad. Sci. USA 94, 9091-9095 (1997). 

22. Lai-Kee-Him, J. et al. In vitro versus in vivo cellulose microfibrils from plant primary 
wall synthases: structural differences. J. Biol. Chem. 277, 36931-36939 (2002). 

23. Charnock, S. J. & Davies, G. J. Structure of the nucleotide-diphospho-sugar 
transferase, SpsA from Bacillus subtilis, in native and nucleotide-complexed forms. 
Biochemistry 38, 6380-6385 (1999). 


186 | NATURE | VOL 493 | 10 JANUARY 2013 


24. Weigel, P. H. & Deangelis, P. L. Hyaluronan synthases: a decade-plus of novel 


25: 


26. 


27. 


glycosyltransferases. J. Biol. Chem. 282, 36777-36781 (2007). 

Kozmon, S. & Tvaroska, |. Catalytic mechanism of glycosyltransferases: hybrid 
quantum mechanical/molecular mechanical study of the inverting 
-acetylglucosaminyltransferase |. J. Am. Chem. Soc. 128, 16921-16927 (2006). 
Kubota, T. et al. Structural basis of carbohydrate transfer activity by human UDP- 
GalNAc: polypeptide o-N-acetylgalactosaminyltransferase (pp-GalNAc-T10). 

J. Mol. Biol. 359, 708-727 (2006). 

Daras, G. et al. The thanatos mutation in Arabidopsis thaliana cellulose synthase 3 
(AtCesA3) has a dominant-negative effect on cellulose synthesis and plant growth. 
New Phytol. 184, 114-126 (2009). 


28. Benach, J. et al. The structural basis of cyclic diguanylate signal transduction by 


PilZ domains. EMBO J. 26, 5153-5166 (2007). 


29. Scheible, W.R., Eshed, R., Richmond, T., Delmer, D. & Somerville, C. Modifications of 


cellulose synthase confer resistance to isoxaben and thiazolidinone herbicides in 
Arabidopsis |xr1 mutants. Proc. Nat! Acad. Sci. USA 98, 10079-10084 (2001). 


30. Harris, D. M. et al. Cellulose microfibril crystallinity is reduced by mutating 


31. 


C-terminal transmembrane region residues CESA1A903V and CESA3T9421 of 
cellulose synthase. Proc. Natl Acad. Sci. USA 109, 4098-4103 (2012). 

ontanier, C. etal. Evidence that family 35 carbohydrate binding modules display 
conserved specificity but divergent function. Proc. Natl Acad. Sci. USA 106, 
3065-3070 (2009). 


32. Sancho, J. Flavodoxins: sequence, folding, binding, function and beyond. Cell. Mol. 


33. 


Life Sci. 63, 855-864 (2006). 
Delmer, D. P. Cellulose biosynthesis: Exciting times for a difficult field of study. 
Annu. Rev. Plant Physiol. Plant Mol. Biol. 50, 245-276 (1999). 


34. Carpita, N.C. Update on mechanisms of plant cell wall biosynthesis: How plants 


35. 


make cellulose and other (14)-B-p-glycans. Plant Physiol. 155, 171-184 (2011). 
Dowd, M. K., French, A. D. & Reilly, P. J. Conformational analysis of the anomeric 
forms of sophorose, laminarabiose, and cellobiose using MM3. Carbohydr. Res. 
233, 15-34 (1992). 


36. Momany, F. A. & Schnupf, U. DFTMD studies of B-cellobiose: conformational 


37. 


preference using implicit solvent. Carbohydr. Res. 346, 619-630 (2011). 
Brown, M. Cellulose structure and biosynthesis: What is in store for the 21st 
century? J. Pol. Sci. 42, 487-495 (2004). 


38. Rapoport, T. A. Protein transport across the endoplasmic reticulum membrane. 


FEBS J. 275, 4471-4478 (2008). 


39. Burton, B. & Dubnau, D. Membrane-associated DNA transport machines. Cold 


Spring Harb. Perspect. Biol. 2, 1-20 (2010). 


40. Whitfield, C. Biosynthesis and assembly of capsular polysaccharides in Escherichia 


41. 


42. 


Su 
Ac 


coli. Annu. Rev. Biochem. 75, 39-68 (2006). 

Raetz, C. R. H. & Whitfield, C. Lipopolysaccharide endotoxins. Annu. Rev. Biochem. 
71, 635-700 (2002). 
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta 
Crystallogr. D 60, 2126-2132 (2004). 


pplementary Information is available in the online version of the paper. 


knowledgements We are grateful to G. Murshudov for advice on Refmac refinement 


and U. and Z. Derewenda for discussions. We thank L. Tamm, M. Wiener, A. Walling and 
T. Rapoport for critical comments on the manuscript. X-ray diffraction data were 


co! 


lected at GM/CA- and Southeast Regional-Collaborative Access Team beamlines at 


the Advanced Photon Source (APS), Argonne National Laboratory. Use of the APS was 


su 
Sc 


pported by the US Department of Energy, Office of Science, Office of Basic Energy 
iences, contract no. DE-ACO2-06CH11357 and W-31-109-Eng-38. GM/CA at APS 


has been funded in whole or in part with funds from the National Cancer Institute 


1 


Th 
Ce 


-CO-1020) and the National Institute of General Medical Sciences (Y1-GM-1104). 
e University of Georgia CCRC is supported by the Department of Energy funded 
nter for Plant and Microbial Complex Carbohydrates (DE-FGO2-09ER-20097). 


J.L.W.M. is partially supported by a Peach Fellowship, University of Virginia. J.Z. is 


su 


Virginia School of Medicine. 


pport by NIH grant 1RO1GM101001 and start-up funds from the University of 


Author Contributions J.Z. designed the experiments. J.L.W.M. and J.Z. expressed, 


pu 


rified and crystallized the BcsA-BcsB complex. J.L.W.M. and J.Z. analysed the 


crystallographic data and built the model. J.S. performed in vitro cellulose synthesis 


assays. J.L.W.M. and J.Z. wrote the manuscript. 


Author Information The atomic coordinates and structure factors have been deposited 
in the Protein Data Bank under accession number 4HG6. Reprints and permissions 
information is available at www.nature.com/reprints. The authors declare no 
competing financial interests. Readers are welcome to comment on the online version 
of the paper. Correspondence and requests for materials should be addressed to J.Z. 


(jo 


chen_zimmer@virginia.edu). 


©2013 Macmillan Publishers Limited. All rights reserved 


METHODS 
Expression and purification of BcsA-BcsB. The bcsA gene was PCR amplified 
from Rhodobacter sphaeroides 2.4.1. genomic DNA (ATCC) and cloned using 
Ncol and HindIII restriction sites into an engineered pETDuet expression vector 
(Novagen) conferring a C-terminal dodeca-histidine tag. The bcsB gene was PCR 
amplified without its native, N-terminal signal sequence. The gene coding for 
amino acids 21-725 of BcsB was cloned into the pETDuet vector containing the 
bcsA gene, together with an N-terminal PelB signal sequence using NdeI and 
KpnI restriction sites. After signal sequence cleavage, this generates an additional 
Met-Gly dipeptide at the N terminus of BcsB. Electro-competent Rosetta 2 cells 
were transformed with the pETDuetBcsA-12His-BcsB construct and plated on 
agar plates supplemented with 100 1g ml! ampicillin and 25 pg ml“! chloram- 
phenicol and grown overnight at 37 °C. The transformed cells were re-suspended 
and used to inoculate 361 of ZYP-5052 auto-induction medium”. The cells were 
grown shaking at 37 °C for at least 4h after the optical density at 600 nm reached 
0.8. Subsequently, the cells were collected by centrifugation, re-suspended in 
RB1-buffer containing 20mM sodium phosphate buffer pH 7.2, 0.1M NaCl 
and 5 mM cellobiose and then lysed in a microfluidizer. The crude membranes 
were collected by centrifugation for 60 min at 120,000g in a Beckman Ti45 rotor 
and solubilized for 60 min at 4°C in RB2 buffer containing 20mM sodium 
phosphate buffer pH 7.2, 0.3M NaCl, 5mM cellobiose, 5mM MgCl, 40 mM 
imidazole, 10% glycerol and 2% Triton X-100. The insoluble material was cleared 
by centrifugation for 30 min at 120,000g in a Beckman Ti45 rotor and the mem- 
brane extract was batch incubated with 10ml Ni-NTA agarose (Qiagen) for 
45 min at 4 °C. The resin was packed in a gravity flow chromatography column, 
washed with 75 ml WB1-buffer (RB2-buffer containing a total of 60 mM imida- 
zole and 5 mM lauryldimethylamine-N-oxide, LDAO), 25 ml WB2-buffer (RB2 
buffer containing a total of 1M NaCl and 5mM LDAO) and the BcsA-BcsB 
complex was eluted in 50 ml EB-buffer containing 20 mM Tris HCl pH 7.5, 0.1M 
NaCl, 5 mM cellobiose, 5 mM MgCl, 250 mM imidazole, 10% glycerol and 5 mM 
LDAO. The eluted protein was concentrated and purified over an analytical $200 
gel filtration column (GE Healthcare) equilibrated in EB buffer containing 
20mM MES pH 6.5 instead of Tris HCl and lacking MgCl, and imidazole (GF- 
buffer). The eluted BcsA-BcsB was concentrated to 50 1M final concentration 
(using an extinction coefficient at 280 nm of 161,925 M !'cm7!) ina 100-kDa 
cut-off centrifugal filter (Millipore) and supplemented with 10mM EDTA 
(EDTA) and 1 mM uridine diphosphate-glucose (UDP-Glc) before crystallization. 
Selenomethionine-derivatized BcsA-BcsB was prepared as described above 
with the exception that the cells were grown in the auto-inducing PASM-5052 
minimal medium supplemented with 125 jig ml‘ selenomethionine (Se-Met)”. 
The BcsA-BcsB complex was crystallized in 30% PEG200, 0.1M MES pH6.5 
and 50 mM NaClat 4 °C. Initial crystals were observed after approximately 7 days 
and reached their final size of about 50 X 50 X 100 tm within 3 weeks. The crystals 
were collected and directly cryo-cooled in liquid nitrogen. Wild-type crystals were 
soaked with 20 mM samarium m1-chloride (Sm) or 1 mM sodium ethylmercuric- 
thiosalicylate (EMTS) for 2-24h before collection. 
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Structure determination. X-ray diffraction data were collected at the Argonne 
National Laboratory beamlines SER- as well as GM/CA-CAT. The data were 
indexed and integrated in Mosflm™, analysed in Pointless and scaled and reduced 
in space group P432,2 in Scala from the CCP4 suite of programs’. The Sm- 
derivatized crystals diffracted to 5A resolution and were used for single- 
wavelength anomalous dispersion (SAD) phasing in ShelX**. The initial phases 
allowed localization of all transmembrane helices of BcsA and were further used 
to calculate an anomalous difference Fourier electron density map for the data set 
obtained from Se-Met-derivatized BcsA-BcsB. Initially, ten Se-Met positions 
were identified and refined in Mlphare*’. The phases were combined with the 
native data set followed by density modification with phase extension to 3.25 A in 
DM” using a user-generated solvent mask. Throughout the model building pro- 
cess, Se-Met anomalous difference Fourier electron density maps were calculated, 
additional Se-Met positions were refined as described above and new phases were 
calculated and improved by density modification in Parrot’’. The final experi- 
mental phases were calculated from 36 Se-Met-, 4 EMTS- and 8 Sm positions and 
allowed tracing of most of the backbone of BcsA and the membrane proximal 
regions of BcsB. The model was built in Coot’ via an iterative process involving 
careful model building, combination of experimental and model phases, density 
modification and restraint refinement in Refmac-5* (including TLS parameters**® 
for the final model). The model includes a f-1,4-linked glucan containing 18 
glucose molecules and a UDP molecule, as well as 2 partially ordered LDAO 
molecules. The UDP molecule was refined with 50% occupancy owing to weak 
electron density. The refined model has 91.8% of its residues in the favoured 
region of the Ramachandran diagram and includes 1.1% outliers, as determined 
by MolProbity. All figures were prepared in Pymol”’ and the sequence conser- 
vation score for BcsB was calculated using CONSURF”. 

In vitro cellulose synthesis. 100 1l of BcsA-BcsB in GF buffer was incubated with 
5mM UDP-Glc, 20 mM MgCl, and 30 uM cyclic-di-GMP at 37 °C for 90 min. The 
polymerization reaction was terminated with 40 mM EDTA and the sample was cen- 
trifuged at 14,000g at room temperature for 20 min. The supernatant was discarded 
and the pellet was washed twice in 500 1l deionized water, dried and subjected to link- 
age analysis by the Complex Carbohydrate Research Center, University of Georgia. 
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DEVELOPMENTAL BIOLOGY 


Led by the nose 


Hagfish embryos show developmental features that contradict the idea that these 
jawless fish are the most primitive living vertebrates. The findings also help to 
trace the evolution of vertebrate cranial structure. SEE ARTICLE P.175 


PHILIPPE JANVIER 


agfish are bizarre creatures. These 
Heiser marine fish have a skull 

but no vertebrae, and teeth but no 
jaw. These and other odd features have long 
fascinated evolutionary biologists, who regard 
the ‘primitive’ characteristics of hagfish as a 
treasure trove of clues to understanding the 
early evolution of vertebrate anatomy and 
physiology. But hagfish embryos are reput- 
edly scarce, so their development has remained 
an enigma. Six years ago, however, a team of 
Japanese researchers realized’ that hag- 
fish embryos are not particularly rare: they 
simply take more time to mature than other 
piscine vertebrates. This discovery triggered 
extensive investigations of hagfish embryos, 
some of which are published by Oisi et al.’ on 
page 175 of this issue*. The authors provide 
a spectacular overview of hagfish craniofacial 
development, with particular reference to the 
origins of the olfactory organs and pituitary 
gland, two key features in the shaping of the 
vertebrate skull’. 

Hagfish often feed on sunken carcasses, 
tearing off food using a peculiar tongue-like 
feeding device that is armed with sharp teeth 
made of keratin. They were first classified as 
cyclostomes, alongside lampreys, because 
both groups of jawless fish have similar feed- 
ing devices and pouch-shaped gills, and both 
lack a calcified skeleton. Cyclostomes were 
regarded as an early offshoot of the vertebrate 
evolutionary tree. Then, in the 1970s, hagfish 
were considered to be anatomically and physi- 
ologically more primitive than either lampreys 
or jawed vertebrates (gnathostomes), and to 
have diverged earlier than all other verte- 
brates. However, this hypothesis has since 
been dismissed on the basis of DNA sequence 
data that show a close phylogenetic relation- 
ship between lampreys and hagfish*”. But 
until recently, lamprey development had been 
studied much more than that of hagfish. 

Hagfish and lampreys have a single median 
(located in the midline of the body) ‘nostril’ 
which opens anteriorly in hagfish but dorsally 
in lampreys. This nostril is the entrance of a 
tube into which two organs open: a median 
olfactory organ and the adenohypophysis 
(also known as the anterior pituitary gland, 
which is part of the endocrine system). This 


*This article and the paper under discussion’ were 
published online on 19 December 2012. 


monorhinal (single nostril) condition differs 
from the diplorhinal (two nostrils) structure 
of the gnathostomes, in which two well-sep- 
arated olfactory organs open to the exterior 
environment through independent nostrils, as 
do human noses. The separation of the olfac- 
tory organs in gnathostomes is now regarded 
as a prerequisite to the development of jaws’, 
because it allowed the mandibular arch to 
extend forward. 

However, this difference in the structures 
poses a conundrum. The cyclostomes and 
gnathostomes are each monophyletic*”, 
meaning that they form a group compris- 
ing an ancestral species and all its descend- 
ants. Non-vertebrate members of the phylum 
Chordata (to which vertebrates also belong), 
such as amphioxus or tunicates (sea squirts), 
lack an indisputable homologue of the olfactory 
organs and adenohypophysis. This raises the 
question: what was the arrangement of these 
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organs in the most recent common ancestor to 
all vertebrates? To answer this, Oisi et al. stud- 
ied the embryonic tissues that give rise to the 
olfactory organs and adenohypophysis in 
hagfish, lampreys and gnathostomes. These 
tissues, called placodes, are thickenings of the 
embryonic ectoderm (the outer germ layer) 
that develop into various sensory organs, such 
as the olfactory organs, eye lens, otic capsules 
and the adenohypophysis. 

In the lamprey embryo, the olfactory and 
adenohypophyseal placodes are united into 
a single nasohypophyseal plate (NHP) that 
forms beneath the upper lip and gives rise to 
a nasohypophyseal complex, which migrates 
dorsally later in development (Fig. 1). Oisi 
and colleagues found that hagfish embryos 
also have an NHP, and that the gene-expres- 
sion pattern in both the lamprey and hagfish 
NHP is characteristic of ectoderm-associated 
tissues. These findings clearly refute previous 
speculation that the hagfish NHP derives from 
the endoderm (the inner germ layer that later 
forms the pharynx and digestive tract), which, 
if true, would have been the only case of this 
among vertebrates. 

But this reconciliation of the structure and 
position of the NHP in hagfish and lampreys 
still leaves two distinct conditions among 
vertebrates: cyclostomes have a median NHP, 
and gnathostomes have independent nasal 
and adenohypophyseal placodes (Fig. 1). 
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Figure 1 | Cranial anatomy of living and fossil vertebrates. Oisi et al.” show that the olfactory organs 
and anterior pituitary gland (adenohypophysis) of the two extant forms of jawless vertebrates, hagfish 
and lampreys (collectively referred to as the cyclostomes), develop in much the same way. Their embryos 
have median olfactory (red) and adenohypophyseal (blue) placodes (thickenings of the outer germ layer) 
that are united in a nasohypophyseal plate (NHP). By contrast, in jawed vertebrates (gnathostomes), 
these organs arise from separate tissues: a pair of olfactory placodes and a single adenohypophyseal 
placode. Although placodes are embryonic structures not seen in fossils, the skeletons of extinct jawless 
fish (ostracoderms), which are regarded as more closely related to gnathostomes than to cyclostomes, 
can help in reconstructing the steps towards gnathostome anatomy (the drawings are reconstructions 
from fossil evidence®). The anaspids and osteostracans, for example, display a small dorsal nostril similar 
to that of lampreys, whereas the heterostracans and galeaspids had paired olfactory organs and a large 
shared nostril. This diversity foreshadows the transition from single-nostrilled to paired-nostrilled forms, 
and the separation of olfactory and adenohypophyseal development during vertebrate evolution may 
have allowed the forward extension of the mandibular arch and hence the development of jaws. 
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Could one of these conditions be general — or 
‘ancestral’ — for vertebrates, and the other have 
subsequently derived from it? Oisi et al. suggest 
that the cyclostome condition (a median NHP 
is likely to be the earlier structure, because the 
monorhinal condition is found in the extinct 
jawless vertebrates commonly known as 
ostracoderms, which are between 480 million 
and 370 million years old and are regarded as 
more closely related to gnathostomes than to 
cyclostomes (Fig. 1). 

Placodes are embryonic structures that are 
not preserved in ostracoderm fossils, but the 
skeletons of these animals provide informa- 
tion about their differing cranial anatomy”. 
Ostracoderms display diverse arrangements 
of the cranial spaces that housed the olfac- 
tory organs and the adenohypophysis, but 
in all of them these spaces open towards a 
median nostril, as in cyclostomes. For exam- 
ple, anaspids and osteostracans have a median 
dorsal nasohypophyseal complex similar to 
that of lampreys, which suggests some kind of 
cyclostome-like NHP, although other features 
show that osteostracans are closer to gnatho- 
stomes than to any other ostracoderm. By con- 
trast, heterostracans and galeaspids had paired 
olfactory organs, still closely associated with 
the adenohypophysis in the latter, but open- 
ing anteriorly by a large median nostril and 
suggesting a broader NHP (Fig. 1). 

This anatomical diversity is puzzling, but 
Oisi and colleagues’ findings on hagfish 
development help to clarify the picture. The 
authors propose that the presence of the NHP 
in both hagfish and lamprey embryos sup- 
ports the idea ofa general, ‘pan-cyclostome’ 
pattern for craniofacial development, and 
that the anatomical characteristics that make 
present-day hagfish seem more primitive than 
lampreys are the result of a subsequent loss of 
features in hagfish during evolution. Such a 
pattern, combined with the fact that no NHP 
is seen during the development of any living 
gnathostomes, and the apparent progression 
from the monorhinal to diplorhinal condi- 
tion in ostracoderms, suggests that a crucial 
innovation in the development of the verte- 
brate snout was the loss of a common naso- 
hypophyseal duct and the rise of separate, 
paired nostrils. = 
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PHOTONICS 


Phased array on 


a fingertip 


An array of more than 4,000 optical antennas working in unison has been 
demonstrated on a millimetre-scale silicon chip. The result highlights the 
remarkable capabilities of optical integration in silicon. SEE LETTER P.195 


THOMAS F. KRAUSS 


Ferdinand Braun remarked’ that: “It had 

always seemed most desirable to me to 
transmit the waves, in the main, in one direc- 
tion only”. He was referring to radio waves, 
which had hitherto been known to radiate 
only in a doughnut shape away from the emit- 
ter, as described by Heinrich Hertz two dec- 
ades earlier. The invention for which Braun 
was recognized was the phased array — the 
idea of connecting up multiple antennas fed 
from a common source and using the rela- 
tionship between the antennas’ phase (where 
a wave's peaks and troughs lie) to enhance the 
emission of radio waves in a given direction. 
His work was instrumental in the develop- 
ment of radio transmitters, and led to other 
inventions such as the television antenna and 
radar. The phased-array concept is now also 
extensively used in astronomy, where the 
information from multiple telescopes is col- 
lected in phase to extend the resolution of the 
system. On page 195 of this issue, Sun et al.” 
introduce an integrated optical phased array 
that combines the equivalent of more than 
4,000 telescopes onto a single chip smaller 
than a fingernail. 

The creation of phased arrays in the opti- 
cal domain has been an active area of research 
for some time, as it is motivated by the need 
to improve applications such as optical beam 
steering and imaging. To achieve the highest 
performance, the challenges are to integrate a 
large number of antennas onto a single chip, to 
place them close together and to fully control 
the phase on each element. The short opti- 
cal wavelength makes it easy, in principle, to 
address the first challenge and to build large 
arrays on a small footprint in an integrated for- 
mat. The other two challenges are much more 
difficult to meet, however, because the opti- 
cal couplers that feed a controlled amount of 
power into each antenna and the phase shifters 
needed to control the phase on each element 
tend to be tens or hundreds of wavelengths 
long, which requires a large distance between 
each antenna. Therefore, the latest realizations 
of integrated optical phased arrays have been 
relatively large and have featured a maximum 
of only 4x4 antennas”. 


1E his Nobel lecture in 1909, physicist 
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The architecture now introduced by Sun 
et al.” addresses all of these requirements and 
is scalable. By carefully designing the optical 
coupler, phase shifter and gratings that act as 
radiating antennas, the team has managed 
to produce a phased array of 64 x 64 antenna 
units, or ‘pixels’ on a footprint of a little more 
than 0.576 x 0.576 millimetres, with each pixel 
covering 9 x 9 micrometres — equivalent to a 
pixel side length of six optical wavelengths. 
Most impressive are the accuracy and control 
that the researchers were able to exercise over 
the array’s optical functions, especially those 
of the optical couplers. 

The device's accuracy and performance are 
highlighted by the well-balanced power output 
achieved across the array. Equally, the fact that 
a phase shift of 7 radians can be attained on 
a pixel size a few wavelengths across (in con- 
trast to the tens or hundreds of wavelengths 
previously required), and with relatively low 
power and crosstalk, points to the remarkable 
capabilities of nanophotonic concepts that 
are now available. Fabrication of the array, as 
a photonic circuit, using only silicon micro- 
electronics processing techniques — a concept 
pioneered’ by the IMEC microelectronics cen- 
tre in Leuven, Belgium — is also a testimony 
to the high performance that can be achieved 
by using silicon as a photonic material. The 
successful demonstration of the authors’ 
array shows that the large-scale integration 
of photonic circuitry in silicon has well and 
truly arrived. 

So where do we go from here? One inter- 
esting idea is that of imaging through light- 
scattering media’, such as roughened glass or 
biological tissue. Such imaging uses the adap- 
tive-optics principle of adjusting the phase of 
an optical wave locally so as to compensate 
for the distortion caused by the medium. 
The authors’ optical phased array now makes 
available the phase control and large pixel 
number that this application requires. To 
achieve imaging through time-varying light 
scatterers, such as turbulent liquids, a fast 
time-response of the system is also essential; 
the small and efficient tuning of each antenna 
demonstrated by Sun et al. can also meet this 
requirement. Another application is in sensor 
networks, in which communication between 
randomly distributed sensors is based on 
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Flows of gas through a protoplanetary gap 


Simon Casassus', Gerrit van der Plas!, Sebastian Perez M!, William R. F. Dent”’, Ed Fomalont’, Janis Hagelberg”, Antonio Hales?*, 
Andrés Jordan®, Dimitri Mawet?, Francois Ménard’’*, Al Wootten’, David Wilner’, A. Meredith Hughes’®, Matthias R. Schreiber", 
Julien H. Girard’, Barbara Ercolano'*, Hector Canovas", Pablo E. Roman!’ & Vachail Salinas! 


The formation of gaseous giant planets is thought to occur in the 
first few million years after stellar birth. Models’ predict that the 
process produces a deep gap in the dust component (shallower in 
the gas” *). Infrared observations of the disk around the young star 
HD 142527 (at a distance of about 140 parsecs from Earth) found 
an inner disk about 10 astronomical units (AU) in radius? (1 AU is 
the Earth-Sun distance), surrounded by a particularly large gap® 
and a disrupted’ outer disk beyond 140 av. This disruption is indi- 
cative of a perturbing planetary-mass body at about 90 au. Radio 
observations*” indicate that the bulk mass is molecular and lies 
in the outer disk, whose continuum emission has a horseshoe 
morphology’*. The high stellar accretion rate’? would deplete the 
inner disk" in less than one year, and to sustain the observed 
accretion matter must therefore flow from the outer disk and cross 
the gap. In dynamical models, the putative protoplanets channel 
outer-disk material into gap-crossing bridges that feed stellar 
accretion through the inner disk’”. Here we report observations 
of diffuse CO gas inside the gap, with denser HCO® gas along gap- 
crossing filaments. The estimated flow rate of the gas is in the range 
of 7 X 10~° to 2 X 10” solar masses per year, which is sufficient to 
maintain accretion onto the star at the present rate. 

The HD 142527 system offers an unhindered view of its large central 
gap, and is a promising ‘laboratory’ in which to observe the ongoing 
formation of gaseous giant planets. The orientation of the disk is well 
understood. Multiwavelength data are consistent with an inclination 
of about 20°, indicating that the disk is almost face-on to our line of 
sight'’. The disk position angle is about —20° east of north, and the 
eastern side is the far side of the disk, as suggested by a clear view of the 
outer disk’s inner rim in the mid-infrared spectral range’*” and by a 
clockwise rotation suggested by what is probably a trailing spiral arm 
to the west®. 

We find that the CO(3-2) emission (that is, emission of CO with 
J = 3-2, where J is the CO rotational quantum number) peaks inside 
the gap. Other disks have been observed to have a CO decrement 
within dust cavities'*'*, and may represent later evolutionary stages 
or different gap-clearing mechanisms. Gas inside dust cavities has 
previously been directly observed very close to the central star (inside 
the dust evaporation radius) using near-infrared interferometry'*”. 
Other indirect observations of gas inside’”*’ dust gaps at larger dis- 
tances from the central star have revealed spectroscopically resolved 
gas tracers, such as rovibrational CO emission at a wavelength of 
4.67 um and [O1] emission at 6,300 A, under the assumption of 
azimuthal symmetry and Keplerian rotation. Spectro-astrometry in 
combination with Keplerian disk models and azimuthal symmetry 
have also been used to infer the presence of CO gas inside disk 
gaps**°. Our data provide a well-resolved observation of gas at sub- 
millimetre wavelengths inside a dust gap. 


The dust gap that we see in the radio continuum (Fig. 1a) indicates 
a decrease by a factor of at least 300 in the surface density of millimetre- 
sized grains, judging from the contrast ratio between the peak on the 
horseshoe-shaped outer disk (the northern flux peak at 360 mJy per 
beam) and the faintest detected signal inside the gap (namely the 
western filament at 1 mJy per beam; see below). Yet there is no 
counterpart in the CO(3-2) map (Fig. 1b) of the arc that we see in 
the radio continuum. CO(3-2) is probably optically thick, as reflected 
by its diffuse morphology, so that it traces the temperature profile 
rather than the underlying density field. Detailed modelling of optical- 
ly thin isotopologue data is required to constrain accurately the depth 
of the gaseous gap. To study the distribution of dense gas inside the 
gap, we use the tracer HCO". 

The second result from our observations is that gas showing 
HCO*(4-3) emission, expected in the denser regions (molecular 
hydrogen number density, my, ~ 10°cm *) exposed to ultraviolet 
radiation, is indeed found in the exposed rim of the dense outer disk, 
but also along gap-crossing filaments. The most conspicuous filament 
extends eastwards from the star, and a fainter filament extends west- 
wards. Both filaments subtend an angle of about 140 + 10° with the 
star at its vertex. The central regions of these filaments correspond 
to the brightest features in the HCO” line intensity maps (Fig. 1c), 
although the outer disk is brighter in peak specific intensity (Sup- 
plementary Fig. 7). Thus, line velocity profiles are broader on the stellar 
side of the filaments than in the outer disk, where they merge with the 
outer-disk Keplerian rotation pattern. These narrow, systemic-velocity 
HCO* filaments are best seen in intensity maps integrated over 
the filament velocities (Fig. 1d, inset). No central peak is seen in the 
channel maps (Supplementary Fig. 2), and so a beam-elongation effect 
can be ruled out. The eastern filament also is notable in terms of peak 
HCO" specific intensity (Supplementary Fig. 7e). For ease of visuali- 
zation, we show deconvolved models of the HCO™ intensity images in 
the inset to Fig. 1d. A related feature is seen in CO(3-2) emission, 
whose intensity peaks in the more diffuse regions surrounding the 
eastern HCO* filament. We note from the inset to Fig. 1 that the 
continuum also shows features under the HCO* filament. These fea- 
tures are faint and grow away from the edges of the horseshoe-shaped 
outer disk. Estimates of physical conditions are given in Supplemen- 
tary Information. 

The molecular and filamentary flows near the star are non- 
Keplerian. Blueshifted emission extends to the east from the central 
intensity peak (Fig. 1d). This velocity componentis broad near the star, 
with emission ranging from —3.4 to +11kms ‘ (Supplementary 
Fig. 2), and is marginally resolved (the central HCO* peak extends 
over about 0.65 X 0.38 arcsec”). In the deconvolved images of the inset 
to Fig. 1d, the peak intensity in the blue- and red-outlined regions are 
separated by about 0.2 arcsec, that is, by the diameter of the inner disk, 
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Figure 1 | ALMA observations of HD 142527, with a horseshoe dust 
continuum surrounding a gap that still contains gas. We see diffuse CO gas 
in Keplerian rotation (coded in Doppler-shifted colours), and filamentary 
emission in HCO™, with non-Keplerian flows near the star (comparison 
models illustrative of Keplerian rotation are shown in Supplementary 
Information). The near-infrared emission abuts the inner rim of the horseshoe- 
shaped outer disk. The star is at the origin of the coordinates. North is up and 
east is to the left. a, Continuum at 345 GHz, with specific intensity units in 
Janskys per beam. It is shown on an exponential scale (colour scale). A beam 
ellipse is shown in the bottom right ofb, and contours are drawn at 0.01, 0.1, 0.3, 
0.5, 0.75 and 0.9 times the peak value. The noise level is lo = 0.5 mJy per beam. 
b, CO(3-2) line intensity shown by white contours at fractions of 0.3, 0.5, 0.75 
and 0.95 of the peak intensity value, 2.325 X 10 ~° W per beam. The underlying 
red—green—-blue image also shows CO(3-2) line intensity, but integrated in 
three different velocity bands, whose velocity limits are indicated in the spectra 
of Supplementary Fig. 8. c, Near-infrared image from Gemini that traces 
reflected stellar light, shown on a linear scale. We applied a circular intensity 
mask to the stellar glare, some of which immediately surrounds the mask. See 


and at a position angle orthogonal to that expected from close-in high- 
velocity material in Keplerian rotation. Very blueshifted emission 
could reach out to 0.2 arcsec from the star (channel at —2.4kms | 
in Supplementary Fig. 2, taking into account the beam). A blueshifted 
CO(3-2) high-velocity component can also be seen at the base of this 
feature, near the star (at —2.1kms ‘ in Supplementary Fig. 5). 

The non-Keplerian HCO* is probably not consistent with a central 
outflow. Stellar outflows are not observed” in disks with inner cavities 
and no molecular envelopes (that is, transition disks). For an outflow 
orientation, the low velocities measured by the lines imply that the 
filaments in HD 142527 would stand still and hover above the star 
(Supplementary Information, section 3). Even the blueshifted emis- 
sion is slow by comparison with the escape velocity. A slow disk wind 
(for example one photoevaporative or magnetically driven) can also be 
excluded on the basis of the high collimation shown by the HCOT 
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Supplementary Fig. 1 for an overlay with the continuum. d, HCO* (4-3) line 
intensity shown by white contours at fractions of 0.1, 0.3, 0.5, 0.75 and 

0.95 of the peak intensity value, 0.40 x 10° 7°w per beam, overlaid on a 
red-green-blue image of HCO™ intensity summed in three different colour 
bands (see Supplementary Fig. 8 for definitions). Insets show magnified views 
of the central features that cross the dust gap. The cross indicates the star at the 
origin, with a precision of 0.05 arcsec, and the arrows point at the filaments. 
Inset to a: same as in a, with a narrow exponential scale highlighting the 
gap-crossing filaments. These features appear to grow from the eastern and 
western sides of the horseshoe. Contours are at 0.0015 and 0.005 Janskys per 
beam. Inset to d: deconvolved models (Supplementary Information) of the 
HCO* emission (green) at velocities where the gap-crossing filaments are seen, 
that is, from 3.2 to 4.3kms '. Intensity maps for the blue and red velocity 
ranges (see Supplementary Fig. 8 for definitions) are shown in contours, with 
levels at 0.5 and 0.95 times the peak values. These red and blue contours are an 
alternative way to present the intensity field shown in d, but deconvolved for 
ease of visualization. 


emission. Indeed, the CO 4.67-|1m emission seen in the inner disk’? is 
purely Keplerian, it does not bear the signature of the disk winds seen 
in other systems and its orientation is the same as that of the outer 
disk. An orthogonal inner disk can also be discounted on dynamical 
grounds (Supplementary Information, section 3). 

It is natural to interpret the filaments as planet-induced gap- 
crossing accretion flows, or ‘bridges’. Because the eastern side is the 
far side, the blueshifted part of the eastern bridge is directed towards 
the star and is a high-velocity termination of the accretion flow onto 
the inner disk. These bridges are predicted by hydrodynamical simula- 
tions when applied to planet-formation feedback in HD 142527 (ref. 7). 
In these simulations, the bridges straddle the protoplanets responsible 
for the dynamical clearing of the large gap in HD 142527. They are 
close to Keplerian rotation in azimuth, but have radial velocity com- 
ponents of 2 0.1 of the azimuthal components. In our data, we see that 
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as the bridges contact the inner disk they also coincide with higher- 
velocity material, at two-dimensional radial velocities less than their 
azimuthal velocities in the plane of the sky. 

An interesting comparison object is the GG Tau circumbinary disk. 
A submillimetre continuum accretion stream”* is seen to cross the gap 
that surrounds the central binary in GGTau, with indications of 
shocked infrared molecular gas in the inner disk”. The angular radii 
of the rings in GG Tau and in HD 142527 are very similar, as are 
the morphologies of the GG Tau streamer and the eastern filament 
(although it is fainter relative to the outer disk in HD 142527). 
However, the GG Tau binary has a mass ratio of about 1, a separation 
of 0.25 arcsec and is aligned along the streamer, whereas in HD 142527 
no stellar companion has been detected (see Supplementary Infor- 
mation for limits; mass ratios are >10 at 0.088 arcsec). The putative 
companions responsible for the streams in HD 142527 are of much 
lower mass than in GG Tau. 

We performed high-contrast infrared imaging to attempt the detec- 
tion of the possible accreting protoplanets that would be expected if the 
gap-crossing bridges observed in HCO” are indeed planet-induced 
gap-crossing accretion flows. Neglecting extinction, we could virtually 
rule out any companion more than about four times as massive as 
Jupiter, at a separation of 0.3-2.5 arcsec (Supplementary Information, 
section 4, and Supplementary Fig. 13, which also concerns the lack of 
close stellar companions). However, according to the hydrodynamical 
simulations the channelling protoplanets should be located inside 
the gap-crossing bridges. Our estimates for the hydrogen column 
number density, Ny, along the bridges correspond to a broad range 
of high visual (~5,500 A) extinction values, 5 mag < Ay $50 mag, for 
standard dust abundances. Any protoplanets embedded inside the 
bridges, and certainly those embedded in the dense horseshoe struc- 
ture, will be obscured, and our mass limits will be correspondingly 
increased. 

As well as the diffuse gas inside the gap and the gap-crossing fila- 
ments, a third feature of our observations is the horseshoe shape of the 
continuum, which was seen previously by the Submillimeter Array’, 
Hawaii, but at coarser resolutions, and whose origin is still unclear. The 
millimetre continuum traces the total dust mass, so the north-south 
specific intensity ratio of 28 + 0.5 reflects the underlying dust mass 
asymmetry. At its peak, the continuum may even be optically thick, as 
it coincides with a decrease in the HCO* emission. For a constant gas- 
to-dust mass ratio, such horseshoe-shaped mass asymmetries arise in 
models of planet-induced dynamical clearing. In general, these horse- 
shoes can be produced by Rossby-wave instabilities, which are seen in 
high-resolution, three-dimensional simulations at the edge of sharp 
density gradients (P. Varniere, personal communication). However, 
horseshoes have also been modelled in the context of large-scale 
vortices induced by sharp viscosity gradients”. 

Another interpretation of the horseshoe continuum is that it results 
from a varying dust-to-gas ratio and azimuthal grain-size segregation. 
By contrast to the continuum, the outer disk is seen as a whole ring in 
HCO* (Supplementary Fig. 7), which is a tracer of dense gas, and is 
probably optically thick along the ring. A rarefaction of millimetre- 
sized dust grains to the south could perhaps explain the lack of a 
submillimetre continuum. Only small dust grains would be found 
in the south. These small grains efficiently scatter the near-infrared 
light seen in Fig. Ic, filling the opening of the horseshoe. However, 
azimuthal dust segregation has been predicted for centimetre-sized 
grains at co-rotation in two-fluid simulations of gap clearing by 
giant-planet formation’, whereas millimetre-sized grains remain 
relatively unaffected. We explain in Supplementary Information why 
differential stellar heating cannot account for the observed north- 
south contrast. 

The filamentary flows and the residual gas inside the gap are in 
qualitative agreement with planet-formation feedback on the parent 
disk, which feedback carves a gap in the dust distribution while still 
feeding stellar accretion through gap-crossing accretion streams. As 
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detailed in Supplementary Information, the observed inflow velocity, 
together with the critical density of the molecular tracer and the section 
of the filaments, provide a lower bound to the mass inflow rate of 
7x10~°Mo yr~!. An upper bound of 2x 10~’Mo yr! can be 
estimated from the continuum mass in the filaments and their 
kinematic timescale. These estimates for the mass inflow rate are close 
to the observed stellar accretion rate’, of 7x 10~8Mo yr~', giving 
quantitative support to our suggestion that the HCO™ filaments 
are inflows. 
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How insulin engages its primary binding site on the 


insulin receptor 
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Insulin receptor signalling has a central role in mammalian biology, 
regulating cellular metabolism, growth, division, differentiation 
and survival’’. Insulin resistance contributes to the pathogenesis 
of type 2 diabetes mellitus and the onset of Alzheimer’s disease’; 
aberrant signalling occurs in diverse cancers, exacerbated by cross- 
talk with the homologous type 1 insulin-like growth factor receptor 
(IGF1R)*. Despite more than three decades of investigation, the 
three-dimensional structure of the insulin-insulin receptor complex 
has proved elusive, confounded by the complexity of producing the 
receptor protein. Here we present the first view, to our knowledge, of 
the interaction of insulin with its primary binding site on the insulin 
receptor, on the basis of four crystal structures of insulin bound to 
truncated insulin receptor constructs. The direct interaction of insu- 
lin with the first leucine-rich-repeat domain (L1) of insulin receptor is 
seen to be sparse, the hormone instead engaging the insulin receptor 
carboxy-terminal o-chain (aCT) segment, which is itself remodelled 
on the face of L1 upon insulin binding. Contact between insulin and 
LI is restricted to insulin B-chain residues. The aCT segment dis- 
places the B-chain C-terminal f-strand away from the hormone core, 
revealing the mechanism of a long-proposed conformational switch in 
insulin upon receptor engagement. This mode of hormone-receptor 
recognition is novel within the broader family of receptor tyrosine 
kinases’. We support these findings by photo-crosslinking data that 
place the suggested interactions into the context of the holoreceptor 
and by isothermal titration calorimetry data that dissect the hormone- 
insulin receptor interface. Together, our findings provide an explana- 
tion for a wealth of biochemical data from the insulin receptor and 
IGFIR systems relevant to the design of therapeutic insulin analogues. 

Insulin comprises two chains (A and B) containing three «-helices 
(residues Al-A8, A12-A18 and B9-B19) constrained by one intra- 
and two interchain disulphide bonds° (Fig. 1a). The insulin receptor is 
a disulphide-linked («%8). homodimer; the extracellular portion of 
each «f protomer contains six domains (L1, CR, L2, FnIII-1, FnIII-2 
and FnIII-3) and an insert domain (ID) within FnIII-27 (Fig. 1b). The 
a-chain component of the ID is terminated by a segment termed «CT, 
spanning residues 704-719 (using the numbering of the insulin 
receptor exon 11— isoform*). Two surfaces of insulin are understood 
to interact with the insulin receptor”"®. The first consists predomi- 
nantly of hormone-dimerizing residues and contacts the primary 
binding site on the receptor (site 1; dissociation constant (Ka) 
~6.4nM) comprising the «CT segment from one insulin receptor 
oa-chain and the central B-sheet of L1 (L1-f2) of the other o-chain 
within the insulin receptor dimer''"* (Supplementary Fig. 1a). The 
second consists predominantly of hormone-hexamerizing residues 
and is proposed to interact with a secondary insulin receptor site (site 2; 
Ka ~400nM) at the junction of FnIII-1 and FnlII-2 of the insulin 
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receptor o-chain opposite to that contributing L1 to site 1 (Sup- 
plementary Fig. la)’?"'»"». Structural study of the insulin receptor 
and IGFIR ectodomains is difficult; their large size, flexible multido- 
main morphology and extensive glycosylation and disulphide bonding 
posing challenges in production and crystallization. Only three ecto- 
domain structures exist: those of the apo-insulin receptor”'* and the 
L1-CR-L2 fragments of insulin receptor'® and IGFIR” (3.7 A, 2.3A 
and 2.6 A resolution, respectively); none is in complex with ligand. 

We present here the first view of the insulin-site 1 interaction on the 
basis of four crystal structures (complexes A, B, C and D) varyingly 
using the L1-CR construct IR310.T (Fig. lc) and the L1-CR-L2- 
(FnIII-1)-«CT(704-719) homodimeric construct IR593.«CT (Fig. 1d), 
which has «CT(704-719) linked to FnIH-1. Complex A (3.9 A reso- 
lution) contains human-insulin-bound IR310.T in complex with a 
Fab fragment from monoclonal antibody 83-7 (ref. 7) and exogen- 
ous “CT (704-719) peptide. Complex B (4.3 A resolution) is the same 
as complex A but uses the high-affinity, truncated insulin analogue [D- 
Pro®”°]-DTI-NH,!*. Complex C (4.4 A resolution) is the same as com- 
plex B but uses the longer peptide «CT(697-719). Complex D (4.4 A 
resolution) contains bovine insulin bound to IR593.«CT in complex 
with a Fab fragment from monoclonal antibody 83-14 (ref. 7). 

The limited diffraction of these crystals necessitated the use of the 
following: (1) weak data at the diffraction limit (a process validated by 
the half-data set Pearson correlation coefficient CC, criterion’®”° and 
ultimately by the evident quality (see below) of the resultant electron- 
density difference maps (EDDMs)); (2) B-factor-sharpened EDDMs 
in model building”'; and (3) dynamic ‘reference-structure’ and ‘jelly- 
body’ restraints during individual atomic-coordinate refinement 
whereby we exploited the known higher-resolution structures of the 
underlying fragments”. Molecular replacement located the insulin 
receptor and Fab domains and generated a clear solution for the 
insulin three-helix ‘core’ structure within each complex, positioning 
the core identically on the respective L1-B, surfaces. The strongest 
feature in the resultant EDDMs was a common, helical-like structure 
lying adjacent to the insulin B-helix on the L1-f, surface that, upon 
B-factor sharpening, exhibited side-chain density of distinctive sizes 
and at spacing consistent with an o-helix. In complex D, this feature 
was connected to the C-terminal strand of FnIII-1, allowing its unam- 
biguous interpretation as xCT and restricting its prima facie sequence 
register to an ambiguity of at most one residue unit, with one alter- 
native strongly favoured. We confirmed our register assignment 
objectively by devising a broad-window analysis of sequence fit 
both to the higher-resolution EDDM of complex A and to the profiled 
residue environments within that complex. Crystallographic pro- 
cessing and refinement statistics are provided in Table 1 and Sup- 
plementary Table 1, with further detail in Methods. 
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Figure 1 | Structure of insulin, insulin receptor and the site 1 complexes. 
a, Insulin. InsA, A chain; InsB, B chain. b, Insulin receptor. CR, Cys-rich 
domain; FnIII-1, FnIII-2, FnIII-3, first, second and third fibronectin type III 
domains; ID, insert domain; L1, L2, first and second leucine-rich-repeat 
domains; TK, tyrosine kinase; TM, JM, transmembrane and juxtamembrane 
segments. c, d, IR310.T and IR593.0CT domain structure, respectively. 


The structures of the insulin-bound site 1 assembly within com- 
plexes A and D are shown in Fig. le, f, respectively; those within 
complexes B and C are effectively isomorphous with complex A 
(Fig. 1g). The Fab-complexed forms of the IR310.T and IR593.«CT 
monomers are shown in Supplementary Fig. 2a, b, respectively. As 
prepared and crystallized, complex D contains a dimer of IR593.«CT 
homodimers crosslinked by four insulins; further description of this 
assembly is provided in Supplementary Fig. 2c-e. Despite the non- 
native connection of «CT to FnIII-1 in IR593.«CT (Fig. 1d), the insu- 
lin-bound site 1 assembly within complex D is also superimposable on 
that of complex A (Fig. 1g). Refinement of these complexes provided 
EDDMs that revealed side-chain bulk for all components within the 
observed site 1 interface (Fig. 1h). The mode of insulin-site 1 inter- 
action emerging from these structures is as follows. The insulin B-helix 
(B7-B21) engages the C-terminal end of the L1-f, strands; the insulin 
A-chain has no interaction with L1 (Fig. le, f). Both chains interact 


Table 1 | X-ray diffraction data collection statistics 
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e, f, Insulin-bound site 1 in complexes A and D, respectively. Blue, red spheres, 
observed chain termini; orange sphere, FnIII-1-«CT junction. g, Overlay of 
insulin-bound site 1 in complexes A (coloured as in e), B (red), C (green) and D 
(white). h, Sample 2Fop; — Fraie map volumes (Byharp = — 160 A’; contours = 
1.1-1.50) for complex A. 


extensively with «CT. Notably, the «CT helix is, with respect to its 
apo-insulin receptor counterpart, both displaced on the L1-f, surface 
and C-terminally extended to include residues 711-715 (see Fig. 2a and 
further discussion below). Residues B22-B30 are not resolved in these 
structures, but, critically, the «CT helix occupies volume that would 
otherwise contain insulin residues B26-B30 if the latter retained their 
receptor-free conformation (see Fig. 2b and Supplementary Fig. 3a). 
Displacement of residues B26-B30 from the insulin core confirms a 
long-speculated™ induced fit upon insulin receptor binding (Sup- 
plementary Fig. 1b). In contrast, the helical core of insulin seems, as 
far as can be discerned at the described resolution, closely similar to 
that of uncomplexed hormone (Supplementary Fig. 3b). The models 
suggest that the two most critical hormone-engaging residues in «CT 
are (1) His 710, which inserts into a pocket formed by invariant’ 
insulin residues Val’, Gly® 8, Ser®? and Val®”; and (2) Phe 714, which 
occupies a hydrophobic crevice formed by invariant insulin residues 


Complex A, data set 1 Complex A, data set 2 


Complex B Complex C Complex D 


Human insulin + IR310.T- 
83-7 + «CT(704-719) 


Human insulin + IR310.T- 
83-7 + aCT(704-7 19) 


[p-Pro®?°]-DTI-NH» + IR310.T- 


Bovine insulin + 
IR593.0CT/83-14 


[p-Pro®?°]-DTI-NH2 + IR310.T- 


83-7 + oCT(704-7 19) 83-7 + oCT(697-719) 


Space group P23 P23 
Cell dimensions 
a, b,c (A) 168.15, 168.15,168.15 168.91, 168.91, 168.91 
a, By °) 90, 90, 90 90, 90, 90 
Completeness (%) 99.8 (99.8)* 98.3 (99.5) 
Resolution (A) 59.5-4.0 (4.1-4.0) 46.8-3.9 (4.0-3.9) 
merge 0.156 (2.36) 0.089 (1.57) 
$Roim 0.039 (0.57) 0.048 (0.83) 
ol 11.5 (1.6) 9.0 (1.0) 
Redundancy 17.3 (17.4) 4.1 (4.2) 
8CCi/2 0.998 (0.580) 0.998 (0.342) 


P23 P23 12 
169.49, 169.49, 169.49 169.23, 169.23, 169.23 118.15, 140.10, 190.02 
90, 90, 90 90, 90, 90 90, 95.04, 90 
99.7 (100.0) 98.5 (98.1) 87.7 (84.9)+ 
29.6-4.3 (4.53-4.3) 29.5-4.30 (4.54-4.3) 56.3-4.4 (4.5-4.4) 
0.188 (2.69) 0.101 (1.09) 0.206 (1.31) 
0.067 (0.97) 0.053 (0.59) 0.143 (0.89) 
7.8 (1.0) 8.2 (1.4) 2.8 (0.8) 
8.8 (8.8) 4.4 (4.3) 2.9 (2.8) 
0.996 (0.330) 0.997 (0.438) 0.980 (0.232) 


Each data set was collected from a single crystal. 

* Highest-resolution shell statistics are shown in parentheses. 
+ Data completeness was compromised by radiation damage. 
$Rpim, precision indicating merging R-factor®°. 


§ CCy/2, Pearson correlation coefficient between independently merged halves of the data set’®. Highest-resolution shell CCy,2 values are significant at at least P= 0.001. 


242 | NATURE | VOL 493 | 10 JANUARY 2013 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


a c 

Asn 711 ka. 

Asp 707 

A 
/ / InsB 
so 
L1 domain 
L1 domain 

b InsA 


(free) Phe®24 


rs 
o™ L1 domain 


Figure 2 | The insulin-site 1 interaction. a, Altered disposition of «CT with 
respect to that in apo-insulin receptor’’. b, Superposition (via the A- and 
B-chain helices) of receptor-free insulin onto the insulin-site 1 complex, 
indicating steric clash of the B-chain C-terminal segment (green) with «CT. See 
also Supplementary Fig. 3a. c, Interaction (at the bulk side-chain level of detail) 


Gly*’, Tle*?, i Leu®"!, Val®* and Leu®!> (Fig. 2c). Asn 711 is 
directed towards Gly*!, Val“? and Glu“*. The hydrophobic face of 
aCT comprising Phe705, Tyr 708, Leu709, Val712 and Val713 
engages a non-polar groove on L1-f, formed by Leu36, Leu37, 
Leu 62, Phe 64, Phe 88, Phe 89, Val94 and Phe 96 (Fig. 2d). Direct 
interaction between the insulin B-chain B9-B19 helix and L1 seems 
to be limited: Val®!? is positioned between Phe 39, Phe 64 and Arg 65, 
whereas Tyr®!® adjoins Phe39 (Fig. 2e). Detailed rotameric and 
atomic level detail of the above interactions is precluded by the 
limited diffraction. 

Relocation of CT upon hormone binding is supported by recent 
studies indicating that residues buried beneath «CT in the apo- 
ectodomain may directly contact insulin’’, and that peptide substitu- 
tions in the xCT segment upstream of Phe 705 that increase the affinity 
of the segment for L1 concomitantly impair insulin binding’. To 
investigate further, we used complex C to test whether the relocation 
observed in complexes A, B and D is a consequence of xCT truncation, 
as in the apo-insulin receptor the visible interaction of wCT with L1-B, 
begins at Glu697. It is not—the relocation persists in complex C 
(Fig. 1g) with no detectable ordering of residues 697-704. 

We validated our interpretation of the «CT engagement with L1-B. 
and insulin further by isothermal titration calorimetry (ITC). First, 
Ala-substituted «CT(704-719) peptides were titrated against the L1- 
CR-L2 construct IR485 (ref. 16). Second, insulin was titrated against 
IR485 pre-saturated with Ala-substituted «CT(704-719) (Supplemen- 
tary Tables 2 and 3 and Supplementary Fig. 4). Individual F705A and 
L709A substitutions significantly impaired «CT binding to IR485, 
supporting their modelled engagement with L1-y. In contrast, indi- 
vidual H710A, N711A and F714<A substitutions significantly impaired 
insulin binding, supporting their modelled engagement with insulin. 
These data are summarized schematically in Fig. 3a. An apparent 
exception is V713A, which affects insulin but not «CT affinity, des- 
pite Val713 being modelled as engaged with L1-f,. However, this 


L1 domain 


L1 domain 


between %CT(704-719) and insulin. View direction is parallel to LI1-B2 surface 
(cyan); white shading shows insulin surface. d, e, Interaction (at the bulk side- 
chain level of detail) between L1 and «CT and between L1 and insulin B-chain 
helix, respectively. Panels based on complex A. 
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Figure 3 | Insulin interactions in L1-CR-L2 mini-insulin receptor and 
holo-insulin receptor. a, Helical wheel representation of ITC-derived insulin 
affinities for IR485 in the presence of Ala-substituted «CT(704-719) (red, 
>100X reduction upon Ala substitution; green, >10* reduction; grey, 
<10X reduction; open circle, not determined). b, Reducing gel autoradiograms 
obtained from holo-insulin receptor after photo-crosslinking of «CT helix to 
bound 7 [Tyr8"4] -insulin. Arrowed band indicates crosslinked insulin 
receptor o-chain-insulin A chain. Colours indicate crosslinking efficiency (red, 
strong; green, medium; blue, weak; grey, none). c, d, Qualitative crosslinking 
efficiency from b mapped onto «CT segment within the site 1 complexes. 
e, Helical wheel representation of crosslinking data presented in b-d. 
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observation aligns with the disorder of Val713 in the apo-insulin 
receptor ectodomain structure’; the interaction of Val 713 with L1- 
B, may thus instead be required to stabilize the critical'' insulin- 
engaging residue Phe 714 at the terminus of the «CT helix. 

The insulin-site 1 interactions described here are further supported 
by a ‘photo-scan’ of the holo-insulin receptor «CT segment, wherein 
individual «CT residues were successively substituted by the photo- 
activatable residue para-azido-Phe (Pap) and then exposed to ultra- 
violet light to probe covalent interaction with ['°I]-Tyr*"* insulin’. 
Strong crosslinking was observed for photo-probes at sites 711 and 
714, medium at sites 707 and 710, and weak at sites 713 and 715; no 
crosslinking was observed at sites 705, 706, 708, 709 or 712 (Fig. 3b). 
This photo-crosslinking efficiency pattern in holo-insulin receptor 
exhibits a marked correlation with and supports the insulin-site 1 
juxtaposition described above (Fig. 3c-e). 

Complexes B and C probed whether the stable B-turn at B24-B26 in 
receptor-free [D-Pro®*°]-DTI-NH,"* might result in ordering of 
residues B22-B26 upon binding site 1—it did not, as these residues 
remained unresolved. We investigated the insulin B-chain C-terminal 
strand further by seeking to photo-crosslink individual Pap probes at 
B24 and B25 to the truncated L1-CR-L2 construct IR468’° in the 
presence of an «CT peptide. In both holo- and ectodomain insulin 
receptor complexes, these probes respectively crosslink to L1 and 
aCT?”**, We observed photo-crosslinking for Pap*’, Pap“* and 
Pap®'® (as in holo-insulin receptor’”) but not for Pap?** or Pap®”° 
(Supplementary Fig. 5). We thus propose that full stabilization of 
B22-B30 upon detachment from the hormone core requires the 
presence of the FnIII domains. 

Superposition of the site 1 complexed assembly onto the L1 domain 
in the apo-insulin receptor ectodomain structure (Supplementary 
Fig. 6) shows that the insulin surface proposed” to bind site 2 overlays 
the FnIII-1-FnlII-2 junction of the alternative «f protomer, that is, 
the proposed site 2 itself. However, significant steric clash is present, 
indicating that insulin bridging of sites 1 and 2 requires relative 
displacement of the FnIII-1 and FnIII-2 domains away from the 
L1-CR module. Such change would alter the relative location of the 
membrane-proximal FnIII-3 domains, potentially initiating signal 
transduction’. 

Concordance of our structures with the extensive biochemical 
literature of this system is presented in Supplementary Discussion. 
Briefly, the structures are consistent with alanine scanning mutagen- 
esis of the insulin receptor L1 domain, revealing now that of the ‘hot- 
spot’ residues, the set Gln 34, Leu 36, Leu 37, Phe 64, Phe 88, Phe 89 
and Glu 120 contact the «CT segment whereas only Phe 39 contacts 
insulin. The structural engagement of insulin receptor residues His 710 
and Phe 714 with insulin concords with (1) alanine scanning mutage- 
nesis of the insulin receptor ectodomain; (2) hydrogen/deuterium 
exchange studies of the closely related type 1 insulin-like growth factor 
receptor (IGF1R); and (3) the lack of hormone binding by insulin 
receptor chimaeras and IGFIR chimaeras containing the «CT seg- 
ment from the insulin-receptor-related receptor, which has threonine 
at the residue equivalent to insulin receptor Phe 714 (IGF1R Phe 701). 
Our structures are also consistent with mutational analysis of insulin, 
showing now that the hotspot residues Gly“, Tle, Val, Tyr’, 
Gly®* and Leu®!’ make contact with the key «CT residues His710 
and Phe714 (Fig. 2c). Although residues Gly®”*, Phe®** and Phe"? 
are unresolved here, evidence is presented for the likely repositioning 
of the B-chain C terminus into the volume between the «CT segment, 
the L1-B, surface and the adjacent CR domain; such repositioning 
rationalizes the enhanced affinity of analogues containing D-amino- 
acid substitutions at B24. 

Our structural and biochemical findings provide groundbreaking 
insight into the primary insulin-insulin receptor interaction. Parti- 
cularly important in binding is the «CT segment, which is without 
known counterpart in other mammalian receptor tyrosine kinases’, 
although it is conserved among metazoan insulin receptor and 
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IGFIR. The concomitant structural re-arrangements of hormone 
and receptor in our model rationalize long-standing anomalies in 
clinical insulin variants associated with diabetes” and define new 
structural targets for novel therapeutics targeting these receptors. 


METHODS SUMMARY 


IR310.T was produced by proteolysis of an engineered mini-receptor precursor 
cIR485 expressed from a stable Chinese hamster ovary Lec8 cell line and purified 
by ion-exchange and size-exclusion chromatography (SEC), followed by combina- 
tion with 83-7, re-purification by SEC and finally combination with excess «CT 
peptide and insulin (or analogue). Crystallization conditions for IR310.T-derived 
complexes A, B and C were 0.9-1.1M tri-sodium citrate, 0.1 M imidazole-HCl 
(pH 8.0) + 0.02% sodium azide (NaN3). IR593.%CT was expressed from a stable 
Chinese hamster ovary Lec8 cell line followed by insulin affinity chromatography 
and then SEC of the resultant bovine insulin complex. Samples were combined 
with 83-14 and re-purified by SEC. The crystallization condition for the 
IR593.«CT-derived complex D was 9% polyethylene glycol 3350, 200 mM proline, 
100 mM HEPES-NaOH (pH 7.5). Diffraction data were collected at the Australian 
Synchrotron (beamline MX2) and Diamond Light Source (DLS; beamline 124). 
Structures were solved by molecular replacement. Introduction of Pap residues 
into the CT segment used orthogonal tRNA/amber suppression technology. 
Mutant holoreceptors were expressed transiently using co-transfected 293PEAK 
rapid cells in Pap-containing medium. Cell detergent lysates were then subjected 
to wheat-germ agglutinin chromatography; eluates containing the receptor were 
concentrated before photo-crosslinking to '*°I-[Tyr*"] insulin. Photo-crosslinked 
products were resolved by gel electrophoresis and auto-radiographed. IR468 was 
produced by transient expression in HEK293 cells. '”°I-photo-activatable insulin 
analogues were incubated with IR468 in the presence of 10° M «CT(703-719)- 
Myc and then photo-crosslinked. Samples were resolved by gel electrophoresis 
and auto-radiographed. For ITC, Ala-substituted «CT peptides (80 uM concen- 
tration) were titrated into a sample cell containing IR485 (10 1M concentration); 
and zinc-free porcine insulin (32-48 LM concentration) was titrated into a sample 
cell containing IR485 (4-6 1M concentration) pre-incubated with Ala-substituted 
aCT peptide (10X molar concentration). All samples were in Tris-buffered saline 
(pH 8.0) plus azide (TBSA). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cloning, production and purification of IR310.T. High yield and correct fold- 
ing of IR310.T was achieved by design of a thrombin-cleavable version of IR485"° 
(termed cIR485) for use as precursor, followed by production and purification 
protocols for cIR485 similar to those described for IR485'°. A 1,741-nucleotide 
base fragment of insulin receptor was synthesized (DNA2.0) and inserted into the 
HindIII-Xbal sites of the vector pEE14 (Lonza). The fragment encoded a protein 
cIR485 (‘cleavable’ IR485) which consists, in order, of (1) the 27-residue insulin 
receptor native signal sequence; (2) residues 1-310 of the insulin receptor «-chain 
(the LI-CR module); (3) a thrombin cleavage site SSSLVPRGSSS; (4) residues 
311-485 of the insulin receptor #-chain (the L2 domain); (5) an enterokinase 
cleavage site DDDDK; and (6) a c-Myc purification tag EQKLISEEDLN”' (see 
Supplementary Fig. 7). A 106-nucleotide-base non-coding fragment 5'-GTCG 
ACGGTACCCCGGGGAATTAATTCCGGGGGCCGCCTCGGAGCATGACC 
CCCGCGGGCCAGCGCCGCGCGCTCTGATCCGAGGAGACCCCGCGCTC 
CCGCAGCC-3’ was included between the HindIII site and the region expressing 
the signal peptide; bases 29-106 of this fragment correspond to those immediately 
upstream of the human insulin receptor coding region. CHO Lec8 cells (CRL- 
1737; ATCC)” were transfected with the cIR485-containing expression vector 
using Lipofectamine 2000 (Life Technologies). Selection was undertaken in the 
presence of methionine sulphoximine (MSX), with cIR485 production monitored 
via western blot with monoclonal antibodies 83-7 and 9E10**. Production scale- 
up, performed under contract by CSIRO, was in spinner flasks (New Brunswick 
Scientific) containing DMEM/F12 medium (Life Technologies) supplemented 
with 10% fetal bovine serum (FBS; Life Technologies) plus 25 uM MSX. The 
medium was supplemented with 0.8mM butyric acid (Sigma-Aldrich) during 
the last week of the production phase. A total of 50 | of culture supernatant was 
collected; harvests were filtered (0.2 um cutoff) and then concentrated 10-fold 
(30kDa cutoff) after addition of 0.1mM_phenylmethanesulphonylfluoride 
(PMSF; Sigma-Aldrich) and 0.02% sodium azide (NaN3). Accumulated super- 
natant harvests were adjusted to pH 8 by addition of 3M Tris-HCl (pH 8.5). 
Initial purification of cIR485 was by 9E10 antibody affinity chromatography 
and size-exclusion chromatography (SEC) using procedures effectively identical 
to those described for purification of IR485’°. Purified cIR485 in Tris-buffered 
saline (24.8 mM Tris-HCl pH 8.0, 137 mM NaCl, 2.7 mM KCl) plus 0.02% NaN3 
(TBSA) was then incubated overnight at 37 °C with 0.25 units human thrombin 
(Roche) per mg of cIR485 in the presence of 10 mM CaCl. Completion of pro- 
teolysis was assessed by SDS-PAGE, which revealed bands corresponding to the 
estimated molecular mass of the IR310.T fragment and the insulin receptor L2 
domain (51 kDa and 30 kDa, respectively; Supplementary Fig. 8a). Western blot- 
ting with monoclonal antibody 83-7 confirmed that the CR domain was contained 
in the upper band alone. The sample was diluted 8-fold in buffer A (10mM 
ethanolamine-HCl, pH 9.6 + 0.02% NaNs), centrifuged for 5 min at 17,000g to 
remove particulates and then loaded onto a Mono Q 5/50 GL column (GE 
Healthcare). The sample was eluted with a 60 column-volume gradient of buffer 
A to buffer B (10 mM ethanolamine-HCl, 400 mM NaCl, pH 9.6 + 0.02% NaN3; 
Supplementary Fig. 8b) and the fractions assessed by SDS-PAGE. Fractions con- 
taining IR310.T were pooled, concentrated and re-run in TBSA on a Superdex 
200 10/300 column (GE Healthcare). The chromatogram exhibited three over- 
lapping peaks, probably arising from multimerization (Supplementary Fig. 8c). 
SDS-PAGE of fractions revealed the presence of three closely spaced bands, which 
we attributed to varying glycosylation (Supplementary Fig. 8d). The final yield of 
IR301.T was ~0.5mgml ' of cell culture. 

Large-scale production and purification of Fab 83-7. A hybridomal cell line 
expressing monoclonal antibody 83-7” was a gift from K. Siddle. Large-scale 
production and purification of monoclonal antibody 83-7 and preparation 
of Fab 83-7 from monoclonal antibody 83-7 were based on protocols des- 
cribed previously’. In particular, monoclonal antibody 83-7 was proteolysed with 
dithiothreitol-activated papain (Sigma-Aldrich) at 37 °C. Proteolysis was stopped 
by adding iodoacetamide (IAA; Sigma-Aldrich) and the reaction mixture then 
passed down a Superdex 200 26/60 column (GE Healthcare). Fractions containing 
83-7 F(ab’), were isolated and reduced with mercaptoethylamine (Sigma-Aldrich) 
and then alkylated with IAA, followed by further SEC and cation-exchange chro- 
matography on Mono S (GE Healthcare). 

Production of Fab 83-7-complexed IR310.T. IR310.T was mixed with a slight 
molar excess of Fab 83-7 and the complex purified by SEC using a Superdex 
$200 10/300 column (GE Healthcare) in TBSA buffer. Fractions containing the 
complex of IR310.T/Fab 83-7 were then pooled, concentrated and exchanged into 
10 mM HEPES-NaOH (pH 7.5) + 0.02% NaN; for crystallization screens. 
Cloning, production and purification of IR593.aCT. CHO Lec8 cells stably 
expressing IR593.«CT were obtained from CSIRO. These cells were derived by 
co-transfection with both the linearized expression plasmid pEE14 (Lonza) and 
a pEF puromycin plasmid (Springer Labs). The pEE14 expression plasmid 


contained, inserted between its HindIII-Xbal sites, a fragment encoding (in order) 
the 27-amino-acid residue insulin receptor signal sequence, insulin receptor resi- 
dues 1-593 and insulin receptor residues 704-719 (see Supplementary Fig. 9). The 
106-nucleotide-base non-coding fragment described above was also included here 
between the HindIII site and the start codon. Selection was undertaken in the 
presence of puromycin, with IR593.%0CT production monitored via europium- 
labelled insulin binding assays**. Production scale-up, performed under contract 
by CSIRO, was in spinner flasks (New Brunswick Scientific) containing DMEM/ 
F12 medium (Life Technologies) supplemented with 10% FBS plus 10 1g ml’ 
puromycin. Accumulated supernatant batches were adjusted to pH 8 by addition 
of 3 M Tris-HCl (pH 8.5). PMSF was added to 0.1 mM concentration and NaN; to 
0.02%. The culture supernatant was filtered (0.2 um cutoff), concentrated 10-fold 
(10 kDa cutoff) and pumped through a Sephacryl $100 column (GE Healthcare) 
onto an in-house manufactured bis-Boc-insulin MiniLeak medium resin column 
(Kem-En-Tec)**. The latter was eluted with bovine insulin in TBSA. PMSF was 
again added to 0.1 mM concentration and the eluates concentrated for Superdex 
$200 26/60 or S200 10/300 (GE Healthcare) SEC. The SEC profiles showed a single 
major peak corresponding to the insulin-complexed IR593.%CT homodimer anda 
further peak corresponding to excess bovine insulin (Supplementary Fig. 8g). At 
higher concentrations, an additional peak appeared corresponding to a dimer of 
IR593.0CT homodimers (Supplementary Fig. 8h). Contents of fractions were 
analysed by SDS-PAGE (data not shown). The final yield of IR593.«0CT was 
~0.4mgl ' of cell culture supernatant. 

Large-scale production and purification of Fab 83-14. A hybridomal cell line 
expressing Mab 83-14" was a gift from K. Siddle. Cells were grown in 950 cm” 
roller bottles at 37 °C in 5% CO) (balanced with air) at 10 r.p.m., using approxi- 
mately 300 ml H-SFM medium supplemented with 5% fetal calf serum (FCS). 
Typically, 2-31 of cell culture at a viable cell density of 2-3 X 10° cells ml”! was 
achieved. Cells were then pelleted (5 min, 350g) and re-suspended in serum-free 
medium. The culture was collected when cell viability dropped below 30% (after 
5-7 days). Monoclonal antibody 83-14 from collected supernatant was captured 
using a ProSep-vA column (Millipore). Monoclonal antibody 83-14 was proteo- 
lysed with dithiothreitol-activated papain (Sigma-Aldrich) at 37 °C. Proteolysis 
was stopped by adding IAA and the reaction mixture containing Fab 83-14 was 
then passed down a Superdex 200 26/60 column. Final purification of Fab 83-14 
was by cation-exchange chromatography on a MonoS column (GE Healthcare). 
Purification of the IR593.aCT + Fab 83-14 + insulin complex. The IR593.«0CT 
protein was combined with an excess of bovine insulin and Fab 83-14 and then 
re-subjected to SEC. The SEC profiles (Supplementary Fig. 8i) showed a major 
peak with a molecular mass corresponding to that of a dimer of Fab-complexed 
IR593.%0CT homodimers and a shoulder of molecular weight corresponding to that 
of an Fab-complexed IR593.«CT homodimer, as well as peaks corresponding to 
excess Fab 83-14 and insulin. Fractions corresponding to the higher molecular 
mass component of the major peak were pooled and then concentrated in a 10 mM 
HEPES-NaOH (pH7.5) + 0.02% NaN; buffer for crystallization. 

Production of IR468-containing media. A DNA fragment encoding the signal 
peptide plus residues 1-468 of human insulin receptor was inserted into the 
HindIII-Xbal sites of the vector pcDNA3.1zeo(+) (Life Technologies) and 
then transiently transfected into HEK293 cells. Cells were grown in Dulbecco’s 
modified Eagle’s medium (Life Technologies) supplemented with 10% FCS for 
72h after transfection; media containing IR468 were then collected and stored at 
—20°C until used. 

Crystallization and diffraction data collection of complexes A to D. Complexes 
A, B, C and D are as defined in Table 1. Complexes A, B and C were prepared for 
crystallization by combining the Fab 83-7 bound IR310.T (final concentration 
3.5 mg ml!) with a 1.5x molar ratio of CT peptide (Genscript) and 3X molar 
ratio of human insulin (Sigma-Aldrich) or [p-Pro®*°]-DTI-NH> in 10mM 
HEPES-NaOH (pH 7.5) + 0.02% sodium azide (NaN3). [D-Pro®°]-DTI-NH, 
was prepared as described previously”. A single crystallization condition for 
complex A was detected using a sparse-matrix 792-condition screen (CSIRO 
Collaborative Crystallization Centre) and optimized manually to 0.9-1.1 M tri- 
sodium citrate, 0.1 M imidazole-HCl (pH 8.0) + 0.02% NaN; for all complexes A, 
B and C. An initial crystallization condition for complex D at 2.0 mg ml! was 
detected in a similar 792-condition screen and optimized manually to 9% 
PEG3350, 200 mM proline, 100 mM HEPES-NaOH (pH 7.5). For data collection, 
the complex A and B crystals were cryo-protected by transfer to paraffin oil HR403 
(Hampton Research), the complex C crystal by progressive increase of the 
precipitant concentration to saturated tri-sodium citrate, and complex D crystal 
by addition of 20% glycerol to the crystallant solution. All crystals were cryo- 
cooled by plunging into liquid nitrogen and maintained at ~100K during data 
collection. Data set 1 for the complex A crystal and the data set for the complex D 
crystal were collected at beamline MX2 at the Australian Synchrotron’ 
(A = 0.95369 A). The remaining data sets were collected at beamline 124 at the 
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Diamond Light Source (A = 0.96860 A). Diffraction data were processed with 
XDS” and CCP4”. Resolution limits were assessed on the basis of the significance 
of CCj,/z at the P = 0.001 level’’. Data processing statistics are provided in Table 1. 
Structure solution and refinement of complex A. PHASER" located within the 
asymmetric unit a single copy each of the LI-CR module and the variable module 
of 83-7 (using data set 1). The 83-7 constant module was not located and ulti- 
mately presumed disordered. TLS parameters and individual restrained isotropic 
B-factors were then refined using autoBUSTER™, followed by atomic coordinate- 
only refinement, yielding Rou /R free = 0.368/0.363 (where R*P™ is the 
crystallographic R-factor reported by autoBUSTER, computed using the expecta- 
tion value of F.aic instead of the value itself)**. The difference density maps con- 
tained four helix-like features encompassing the first, third, fourth, fifth and 
seventh highest peaks in the map (the second highest peak being glycan conjugated 
to Asn 111; Supplementary Fig. 10a). These features lay adjacent to the L1-f, 
surface, and PHASER then positioned the three insulin helices as a rigid entity 
onto three of these (TFZ = 11.4). Correctness of this solution was verified by 
exhaustive six-dimensional real-space search using ESSENS“ (Supplementary 
Fig. 10b). The structure was then refined further, yielding R“\wor/R “tree = 
0.320/0.339. The remaining helix-like feature in the above difference map (the 
first and fifth highest peaks) revealed clear side-chain protrusions upon negative 
B-factor map sharpening”', spaced consistent with an underlying «-helix 
(Supplementary Fig. 10c). We concluded that it arose from the «CT peptide as 
there were no further sources of peptide in the crystallization sample. A ten- 
residue poly-alanine «-helix was then docked into this feature using ESSENS; 
the best fit was adequately discriminated from lower-scoring fits and visual inspec- 
tion confirmed the direction of the helix as judged by protruding side-chain 
density (Supplementary Fig. 10c). A procedure was then devised to assign, in an 
objective fashion, the sequence register to this 10-residue segment, based on 
compatibility of individual residue side chains both with sharpened difference 
electron density and with their surrounding protein environment. (1) Fit to dif- 
ference density used a method designed not only to assess the fit of atoms within 
the density, but also to penalize the existence of volumes of positive difference 
density into which no atoms had been placed. Briefly, voxels corresponding to the 
aCT feature were excised from the B-factor-sharpened*! map using CHIMERA* 
(cutoff level = 0.16 e~ A~*) and placed within a rectangular grid volume large 
enough to allow a >8 A buffer zone around the feature. All grid points outside of 
the feature were set to 0 e A>, with the resultant map being termed Mobs. 
Coordinates for residues 1, 2 and 10 of the fitted poly-alanine helix were then 
deleted from the model, as the density associated with them displayed poorer 
a-helical geometry. Ten alternatives, corresponding to all possible alignments of 
sub-sequences of «CT(704-719) with the heptameric poly-alanine structure, were 
then assessed. Rotamers for each residue within each of these ten models were then 
assigned manually using COOT* based on visual inspection of the fit of the trial 
rotamer to the density at the corresponding site in Mg). The ten individual models 
were then ‘real-space-refined’ within COOT to achieve optimal fit to Mo,;, main- 
taining tight helical restraints. An electron density map M,q1., on the same grid as 
Mobs was then generated (using SEALL within CCP4) for each heptamer model 
in isolation, with the B-factors of all main-chain atoms being set to 10 A? and of 
all side-chain atoms set to 20 A” to allow for subsequent comparison with a 
sharpened map. All grid points within Mcac with p<0.45e— A> were set to 
Oe A~>. Correlation coefficients CC = (<xy> — <x> <y>)I\[(<x?> — <x>?) 
(<y> -< y>’)] between Mops and each Mcaic were then calculated using 
MAPMAN™. Each CC was then ‘normalized’ by dividing its value by the CC value 
calculated for the underlying poly-alanine heptamer, and the quotient termed the 
trial sequence’s ‘density score’. The heptamer with trial register 705-711 was seen 
to have the highest density score (Supplementary Table 4). (2) Compatibility of 
residue side-chain environment within the Ll1-«CT-insulin interface was assessed 
as follows. For each of the ten trial heptamer (real-space refined) models described 
above, an ‘environment score’ was generated using VERIFY3D* to assess com- 
patibility with the surrounding L1-f, and insulin surfaces. Trial register 705-711 
again scored highest (Supplementary Table 4). A “combined score’ was then com- 
puted as the product (density score) X (environment score) to assist assessment of 
lower-ranked trial registers (Supplementary Table 4). The next highest combined 
score was for trial register 709-715, which is related to trial register 705-711 by a 
one-turn translation along the helix, effectively maintaining hydrophobic-to- 
hydrophobic docking with the L1 surface. However, register 709-715 was judged 
to be most unlikely, as it would bring Pro716 and Pro718 into the remaining 
C-terminal region of the helix. We concluded that register assignment 705-711 
was correct, given that it ranked highest on all criteria. The assignment aligned with 
that assessed directly from the EDDMs for complex D, wherein the «CT segment is 
directly attached to the C-terminus of FnIII-1 and the ambiguity of assignment 
strictly limited. Nine sugar residues were also included at N-linked sites 16, 25, 111, 
225 and 255'°”. Final refinement was against data set 2, using local structural 
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similarity restraints (LSSRs)** against Protein Data Bank (PDB) structures 2HR7- 
A (L1-CR), HL1-B (83-7 light chain) and 1FNS-H (83-7 heavy chain), yielding 
RPO or/ RP ree = 0.264/0.292 (see Supplementary Table 1). Ramachandran stat- 
istics are: favoured region 91%, acceptable region 5% and disallowed region 4%. 
Structure solution and refinement of complexes B and C. These structures were 
solved independently. MOLREP” was used to locate the L1-CR/83-7 fragment 
and the structure then refined with REFMACS5*. [p-Pro®°]-DTI-NH) (PDB entry 
2WRW) was then located using MOLREP, its position being effectively identical to 
that in the native insulin complex (complex A). The «CT segment from the wild- 
type insulin complex was fitted in the final electron density maps using COOT. 
Final refinement gave Rwork/Réree = 0.289/0.349 and Ryork/Riee = 0.289/0.335, 
respectively (Supplementary Table 1). Ramachandran statistics for both com- 
plexes are: favoured region 89%, acceptable region 7% and disallowed region 4%. 
Structure solution and refinement of complex D. Molecular replacement with 
PHASER located sequentially two copies each of a module consisting of Fab 83-14 
in complex with the FnIII-1 domain, an LI-CR module and an L2 domain, using 
search models derived from PDB entry 3LOH”. The assembly was subject to rigid- 
body and TLS refinement using autoBUSTER, followed by individual restrained 
isotropic B-factor and coordinate refinement, subject both to LSSRs to PDB entries 
2HR7-A (L1-CR-L2), 3LOH-E (Fnlll-1), 1PLG-H (83-14 heavy chain) and 
2VXU-M (83-14 light chain) and to non-crystallographic symmetry (NCS) 
restraints, yielding RP \o4/R ree = 0.308/0.299. Difference electron density 
analysis revealed: (1) two helix-like features of electron density, each connected 
to a respective insulin receptor residue Thr 593 and lying on the L1-f, surface of 
an adjacent monomer, thus identifiable as the «CT segment of IR593.0CT; and (2) 
two clusters of three helix-like features, each cluster lying adjacent to a respective 
one of the above “CT segments and putatively identifiable as insulin. Insulin was 
then docked into each of the above putative insulin-xCT clusters of features by 
means of an exhaustive six-dimensional ESSENS search. The highest scoring 
solution in each instance overlaid insulin accurately onto the respective three- 
helix cluster (Supplementary Fig. 10d) and was well discriminated from lower 
scoring solutions; the pose of insulin and %CT with respect to L1-B was effectively 
identical at the two NCS related sites. The two insulins were then included in the 
model followed by further refinement. COOT was then used to position poly- 
alanine «-helices into the respective density features corresponding to the «CT 
segment (sharpened by the application of a negative B-factor), followed by muta- 
tion of the sequence to that of xCT residues 704-715. Assignment of the «CT 
register presented no more than a single residue prima facie ambiguity (depending 
on the way in which residues at the FnIII-1-«CT peptide junction were built). One 
alternative was strongly favoured, aligning independently with that determined 
for complexes A, B and C above. Residues B5-B7 of each insulin could be built 
into electron density following a path similar to that seen in certain high-affinity 
insulin analogues'*. Further crystallographic refinement was then undertaken 
with autoBUSTER, yielding final R*“\yo4/R’PYree = 0.244/0.260 (Supplemen- 
tary Table 1). Ramachandran statistics are favoured region 88%, acceptable 
region 7% and disallowed region 5%. 

Holoreceptor photo-crosslinking. Production and photo-crosslinking of deriva- 
tized holoreceptors were performed as described previously”’. 
Photo-crosslinking of Pap-derivatized insulins to IR468. Photo-activatable 
insulin derivatives Pap*?-Ins, Pap“®-DKP-Ins, Pap®’®-DKP-Ins, Pap®**-DKP- 
Ins, [Pap-®”°]-DP-Ins were synthesized as described”**! and iodinated using the 
chloramine-T oxidation method. «CT(703-719)-Myc peptide was synthesized 
in-house. '*°I-photo-activatable insulin analogues were incubated overnight at 
4°C with IR468 and 10 °M oCT(703-719)-Myc in 100 mM HEPES (pH7.5), 
100mM NaCl, 10mM MgCh, 0.05% bovine serum albumin, 0.025% Triton 
X-100. One aliquot was exposed to short-wavelength ultraviolet light (Minera- 
light UVG-54) for 30s at 1 cm and a second aliquot treated with 0.2 mM disucci- 
nimidyl suberate (DSS; Pierce). Samples were diluted with Laemmli sample buffer, 
resolved on 10% Tricine SDS-PAGE and auto-radiographed. 

Isothermal titration calorimetry (ITC). ITC analysis of insulin binding to 
IR310.T and its Fab 83-7 complex followed protocols identical to those described 
previously”. The affinity of zinc-free human insulin for IR310.T in the presence 
of a 10X molar ratio of «CT(704-719) was determined to be Ky=30nM 
(Supplementary Fig. 8e) and the affinity of zinc-free human insulin for 
IR310.T-Fab 83-7 in the presence of a 10X molar ratio of «CT(704-719) to be 
Kg = 48 nM (Supplementary Fig. 8f). ITC analysis of mutant «CT peptides bind- 
ing to IR485 and of insulin binding to IR485 in complex with mutant «CT peptide 
were also performed as described previously”, with samples prepared in TBSA 
(25mM _ Tris-HCl, 137mM sodium chloride, 2.7mM _ potassium chloride, 
pH 8.0 + 0.02% NaN3). 
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Glaciations in response to climate variations 
preconditioned by evolving topography 


Vivi Kathrine Pedersen! & David Lundbek Egholm? 


Landscapes modified by glacial erosion show a distinct distri- 
bution of surface area with elevation’* (hypsometry). In particu- 
lar, the height of these regions is influenced by climatic gradients 
controlling the altitude where glacial and periglacial processes are 
the most active, and as a result, surface area is focused just below 
the snowline altitude’. Yet the effect of this distinct glacial hypso- 
metric signature on glacial extent and therefore on continued 
glacial erosion has not previously been examined. Here we show 
how this topographic configuration influences the climatic sensi- 
tivity of Alpine glaciers, and how the development of a glacial 
hypsometric distribution influences the intensity of glaciations 
on timescales of more than a few glacial cycles. We find that the 
relationship between variations in climate and the resulting vari- 
ation in areal extent of glaciation changes drastically with the 
degree of glacial modification in the landscape. First, in landscapes 
with novel glaciations, a nearly linear relationship between climate 
and glacial area exists. Second, in previously glaciated landscapes 
with extensive area at a similar elevation, highly nonlinear and 
rapid glacial expansions occur with minimal climate forcing, once 
the snowline reaches the hypsometric maximum. Our results also 
show that erosion associated with glaciations before the mid- 
Pleistocene transition at around 950,000 years ago probably pre- 
conditioned the landscape—producing glacial landforms and 
hypsometric maxima—such that ongoing cooling led to a signifi- 
cant change in glacial extent and erosion, resulting in more exten- 
sive glaciations and valley deepening in the late Pleistocene epoch. 
We thus provide a mechanism that explains previous observations 
from exposure dating”® and low-temperature thermochronology™ 
in the European Alps, and suggest that there is a strong topo- 
graphic control on the most recent Quaternary period glaciations. 

It has long been recognized that topographic feedbacks between 
glacial erosion and glacial mass balance influence glaciations”’?°. 
For example, previous numerical studies have shown how glacial ero- 
sion can reduce the extent of subsequent glaciations under constant 
climate conditions by lowering of topography’*"*. However, the effect 
of a distinct glacial landscape hypsometry on glacial extent through 
several glacial cycles has not been considered explicitly, and limited 
work only has focused on transient effects in glacial extent resulting 
from glacial erosion processes modifying the topography’*"*°. 

Here we examine the response of mountain-range glaciations to 
the present topographic distribution for a varying climate using a 
numerical surface process model, including an ice-sheet model suit- 
able for rugged mountain topography~*** (see Methods Summary and 
Supplementary Information). Furthermore, in an additional experi- 
ment, we impose a climate forcing comparable with Quaternary climate 
records”, to investigate possible feedbacks between climate, topography, 
glacial extent and glacial erosion that may have prevailed throughout the 
last two million years (Myr) of Earth’s history. 

The hypsometry of a landscape influences the mass balance of 
glaciers, because it dictates how much surface area is available for snow 
and ice accumulation when paired with a temperature-elevation dis- 
tribution. Therefore, for a given variation in snowline altitude, the 


change in glacier accumulation area is a nonlinear function of the 
topographic configuration. This nonlinearity becomes increasingly 
pronounced through the development of the distinct hypsometric 
maximum found worldwide below the present snowline altitude for 
glacially modified areas** (see Supplementary Figs 1-5). 

To illustrate this, we use our numerical approach for simulating 
glaciation of a catchment from Sierra Nevada, Spain, where limited 
glacial activity has occurred throughout the Quaternary”*, and of a 
catchment from the Bitterroot Range, USA, which has been signifi- 
cantly modified by glaciers during the Quaternary”® (see Methods 
Summary, Supplementary Information, and Supplementary Videos 
1 and 2). Modelled glaciers develop as a result of steady cooling over 
a period of 50 thousand years (kyr) followed by a 50-kyr steady tem- 
perature increase. The snowline altitude, which in our model is a linear 
function of surface temperature, is for both catchments lowered from 
2,800 m down to 1,900 m (Fig. la, b). This snowline span of 900m 
corresponds roughly to the difference between present-day snow- 
line altitudes and suggested Last Glacial Maximum (LGM) snowline 
altitudes’. Importantly, although it is representative of the LGM 
snowline variation in the Bitterroot Range, the snowline interval is 
not meant to represent the actual LGM snowline lowering for Sierra 
Nevada, where the real snowline altitude reached only the highest parts 
of the mountain range during the Quaternary”. Instead, the experi- 
ment is designed to highlight the isolated effect of catchment hypso- 
metry on glacial extent by keeping everything else equal. For the same 
reason, no tectonic uplift or erosion is introduced. 

For the fluvial catchment in Sierra Nevada, the ice volume increases 
with a slightly accelerating rate during snowline lowering as the 
amount of surface area increases downwards (Fig. 1c, d, Supplemen- 
tary Video 1). However, for the glacially modified catchment in 
Bitterroot Range, the relationship between snowline lowering and 
glacial extent is highly nonlinear. The ice volume increases drastically 
when the snowline altitude reaches the hypsometric maximum 
(Fig. 1c, Supplementary Video 2). However, as the snowline altitude 
subsequently falls below the hypsometric maximum, the rate of change 
in ice volume decreases to values lower than for the Sierra Nevada 
catchment (Fig. 1d). So, for a similar climatic forcing, the resulting ice 
volume (that is, areal extent) is very sensitive to the topographic dis- 
tribution in the two catchments. The nonlinearity between climate 
change and glacial extent found for glacially modified landscapes is 
activated when the snowline altitude varies in proximity to the distinct 
glacial hypsometric maximum, which would be the case for a climate 
cooling from the present-day level. 

We then examine the transient signal in glacial extent and glacial 
erosion for a climate forcing similar to what is suggested to have 
prevailed throughout the last 2 Myr of the Quaternary™ by introducing 
glaciers and glacial erosion in a simulated fluvial steady-state landscape 
(Fig. 2a, f, and Supplementary Videos 3 and 4). We reproduce the first- 
order patterns of the Quaternary climate variations by introducing, 
first, a phase of symmetric constant-magnitude 40-kyr climate cycles 
(phase 1), followed by a series of asymmetric 100-kyr climate cycles 
with increasing amplitude (phase 2), leading to an overall decrease in 
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Figure 1 | Ice volume as a function of climate change and hypsometry. 

a, b, Hypsometric distributions for the glacially modified Bear Creek catchment 
in the Bitterroot Range, Montana, USA (46.4° N, 114.4° W), and the fluvial 
Aldeire catchment from Sierra Nevada, Spain (37.1° N, 3.1° W). The 
topography has been normalized to the maximum height above the local base 
level. Black dashed lines represent the minimum level of the snowline altitude at 
50 kyr. For a map view see Supplementary Figs 11 and 12. c, Temporal 


mean temperature, before a final phase of asymmetric constant- 
magnitude 100-kyr climate cycles (phase 3, Fig. 2a). To expose the 
effect of glacial erosion on the areal extent of glaciations, our experi- 
ment includes only glacial erosion processes and the resulting flexural 
isostatic responses of both loading and unloading due to ice and 
erosion. The change in ice volume (that is, glacial extent) is shown 
relative to a reference simulation without glacial erosion (Fig. 2b-d). 

During the first phase of constant-magnitude climate cycles (Fig. 2a), 
landforms that are typically associated with a glacial origin emerge. The 
numerical model includes higher-order ice physics, which facilitates a 
detailed high-resolution simulation of landscape evolution on scales 
small enough to resolve U-shaped valleys, hanging valleys, cirques 
and steep headwalls (Fig. 2h, Supplementary Fig. 14). Furthermore, a 
hypsometric maximum develops just below the minimum imposed 
snowline altitude, similar to global observations (Fig. 2e, Supplemen- 
tary Fig. 1). Concurrently with the development of these topographic 
features, the ice volume, and therefore glacial erosion, decreases 
throughout the occurring glaciations as a consequence of glacial 
erosion, irrespective of climate being constant for each cycle (Figs 2b 
and 3a). Erosion is, to a high degree, focused in the steep regions above 
the lowest snowline altitude, where glacial conditions prevail even 
during the warmest periods of each climate cycle (Fig. 3b). Glacial 
erosion below the lowest snowline altitude is limited to major valleys, 
and glaciers reach far below this level only during the coldest periods 
of the first climate cycles. Thus, glacial erosion lowers mainly the 
topography at high elevations, reducing the glacier accumulation 
space and initiating a negative feedback between glacial erosion and 
glacial extent”. 

During the second phase (Fig. 2a) mimicking the mid-Pleistocene 
transition from 40-kyr to 100-kyr climate cycles, the ice volume 
increases as a consequence of cooling. More interestingly, the change 
in ice volume is greater for the glacially modified landscape than for the 
reference model, and the model with glacial erosion reaches even 
higher ice volumes than the model without glacial erosion (Fig. 2b, d). 
In other words, the glaciated cover of the glacially eroded landscape is 
more sensitive to climatic variations because a lot of surface area is 
concentrated within a narrow elevation span in the same way as is seen 
for the Bitterroot catchment compared to the fluvial Sierra Nevada 
(Fig. 1). Another outcome of focusing surface area within a narrow 
elevation interval is that the glaciations following glacial erosion 
initiate later than for the reference model without glacial erosion, 
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evolution in ice volume for the catchments in Bitterroot Range (solid black line) 
and in Sierra Nevada (dashed black line) for a changing snowline altitude (solid 
grey line represents the Bitterroot Range and dashed grey line the Sierra 
Nevada). d, Temporal evolution of the rate of ice volume change for the two 
landscapes. Animations (Supplementary Video 1 and 2) are available online, 
showing simulated glaciations in the two catchments. 


suggesting shorter but more vigorous glaciations in response to the 
same climatic forcing (Fig. 2b, d). During the constant-magnitude 
climate cycles in the third phase, the negative feedback between glacial 
erosion and glacial extent returns, although it is not as pronounced as 
in phase 1. This is because erosion in phase 1 has already modified the 
longitudinal and transverse shapes of valleys at high elevations, and 
these landforms are now associated with smaller subglacial erosion 
rates than their fluvial counterparts (Supplementary Figs 7 and 15). 

As a consequence of the strong negative feedback between glacial 
erosion, glacial mass balance, and ice extent in phase 1, only limited 
glacial erosion occurs in the main valley during the constant- 
magnitude glacial cycles of this phase (Fig. 3a, b). The negative feed- 
back is reflected by the decreasing glacial erosion rate as well as the 
shorter glacial cycles (Fig. 3a). However, as general cooling is intro- 
duced in phase 2, the negative feedback is broken because snowline 
lowering reincorporates the previously lowered topography into the 
area available for ice accumulation. This leads to a significant jump in 
glacial extent across the transition from the ice-poor conditions at 
the end of phase 1 to the large glaciations in phases 2 and 3 (Fig. 2c), 
enhancing the potential for glacial erosion in the main valleys (Fig. 3c). 
The model experiments thus reveal a greater effect of cooling across the 
mid-Pleistocene transition when the landscape has been prepared by 
earlier glaciations (Fig. 2c and Supplementary Fig. 28). That the model 
including glacial erosion has even more ice in phases 2 and 3 than the 
model without erosion is caused by flexural isostatic rock uplift, adding 
surface area to the hypsometric maximum from below, in combination 
with glacially flattened longitudinal valley profiles (Supplementary 
Figs 15, 17 and 19), which require thicker ice to maintain ice flux from 
the accumulation zone to the ablation area. 

The results presented here thus suggest that topographic modifica- 
tions occurring throughout the Quaternary may have led to a change 
in glaciations for the following cycles, by promoting an increasingly 
nonlinear relation between climate change and the area available 
for snow and ice accumulation. In particular, as the climate cooled 
generally throughout the Quaternary, the more extensive glaciations 
found in the final part of the Quaternary and the contrast in glaciations 
and glacial erosion across the mid-Pleistocene transition may not only 
be the direct results of colder temperatures, but also the responses to 
the developing glacial topographic signature in combination with 
snowline lowering. The significant increase in simulated glacial extent 
and glacial erosion across the mid-Pleistocene transition (Figs 2c 


10 JANUARY 2013 | VOL 493 | NATURE | 207 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


an 
a 
ea 
2 


Snowline 
altitude (km) 


05k Phase 1 Phase 2 
60+ b : 
— With erosion 
= 40 No erosion 
o 
£ 20 
0 
Time (Myr) 
f 
h 
j 


Figure 2 | Transient changes in ice volume for a Quaternary-like climate 
forcing. a, Temporal evolution in snowline altitude. b, Temporal evolution in 
spatially averaged ice volume for models with (red) and without (blue) glacial 
erosion. For our choice of glacial erosion rate, see experiment S1 in the 
Supplementary Information. c, Enlarged view of the change in ice volume on 
the transition from phase 1 to phase 2 for the models with (red) and without 
(blue) glacial erosion, respectively (Box A). d, Enlarged view of the glacial cycle 
in Box B in b). e, Hypsometric distribution for the initial fluvial landscape 
(grey), the landscape after phase 1 (red), and the final landscape after phase 3 


and 3, and Supplementary Fig. 28) can explain results found using 
exposure dating’? and low-temperature thermochronology" in the 
European Alps, suggesting an increase in relief around a million years 
ago owing to extensive valley deepening and only minor erosion at 
higher elevations in association with extensive glaciations following 
the mid-Pleistocene transition. 

In another new study comparing offshore sediment volumes with 
onshore estimates of fjord erosion in Norway~’, it has been suggested 
that extensive erosion occurred both in fjords and at higher eleva- 
tions in the Quaternary. This study may therefore record both an 
initial phase of high-elevation erosion and a subsequent extensive 
deepening of valleys. By suggesting two different timings of main 
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(blue). The dashed black lines represent the minimum snowline altitude in 
phases 1 and 3, respectively. f, Initial fluvial steady-state landscape. Contour 
spacing is 25m. The 100-m contour has been highlighted for reference. 

g, Glacial extent in the first glacial cycle at 20 kyr. h, Landscape configuration 
after phase 1. i, Glacial extent in the last glacial cycle of phase 1 at 980 kyr. 

j, Final landscape after phase 3. Note the uplifted foreland caused by flexural 
isostatic effects. k, Glacial extent in the final glacial cycle at 1,980 kyr. Details of 
modelling parameters, full size figures, and animations are available in the 
Supplementary Information. 


cirque formation and main valley deepening, our results therefore 
reconcile both the hypothesis of bimodal Quaternary glacial erosion’? 
and the studies showing an increased relief across the mid-Pleistocene 
transition’®". 

Climate change is the main driver for glaciations. However, as we 
have demonstrated here, the topographic distribution that emerges as 
a result of glacial erosion processes in alpine settings influences 
the relation between climate and glacial extent in a highly nonlinear 
manner by changing the area available for snow and ice accumulation. 
The fact that areas modified recently by glacial erosion processes 
generally have a distinct hypsometric distribution, with a maximum 
just below the present day snowline altitude (Supplementary Fig. 1), 
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Figure 3 | Variations in glacial erosion for a Quaternary-like climate. 

a, Temporal variation in spatially averaged glacial erosion rate (black line) and 
average glacial erosion for each glacial cycle (blue circles). b, Average glacial 
erosion rate in phase 1. c, Average glacial erosion rate in phases 2 and 3. Spatial 


suggest that these nonlinear effects may have affected most regions 
occupied by alpine-type glaciations throughout the Quaternary, and 
that recent fluctuations in areal extent of glaciations may have been 
significantly affected by a constantly developing topographic control 
on glacier mass balance. 


METHODS SUMMARY 


The numerical surface process model (SPM) we used couples the flow of water, ice 
and sediment. The flow of ice is handled using a higher-order ice-sheet model, 
taking into account effects from steep gradients in the bed and ice surfaces, and 
internal stresses in the ice”’”* (integrated Second Order Shallow Ice Approxi- 
mation; iSOSIA). These effects are of importance when modelling ice flow in alpine 
settings, and their implementation facilitates detailed predictions of long-term 
landscape evolution for direct comparison with present-day landforms. On the 
basis of the flow components, fluvial erosion, glacial erosion (abrasion/quarrying), 
and mass-wasting processes can be estimated using a set of erosion laws”’. Tectonic 
uplift and flexural isostatic effects related to loading/unloading of ice and erosion 
are included. The combined model set-up has been described elsewhere”, 
including testing of the iSOSIA approach against other higher-order ice models 
as well as full-Stokes solutions for ice flow’', however, a summary of the set-up and 
chosen parameter values can be found in the Supplementary Information. 

For the glaciation experiments of real catchments from Sierra Nevada (Spain) 
and the Bitterroot Range (USA), extracts from the SRTM3” data set were inter- 
polated onto a SPM grid with similar resolution. The experiment illustrating 
Quaternary evolution (Figs 2 and 3) initiates from a synthetic fluvial steady state, 
where rock column uplift is balanced by erosion caused by fluvial and hillslope 
processes. This state is reached by uplifting the model domain (20 km X 20 km) ata 
constant uniform rate, while fixing the boundary nodes at sea level. The actual 
experiment excludes tectonic uplift and erosion by fluvial and hillslope processes, 
except that a local threshold criterion, representing local short-term hillslope pro- 
cesses, prevents the emergence of unrealistic steep surface slopes. Lowland is added 
to the edge of the model domain in order for ice to extend from the topography. 
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Craniofacial development of hagfishes 
and the evolution of vertebrates 


Yasuhiro Oisi’?, Kinya G. Ota®, Shigehiro Kuraku‘*, Satoko Fujimoto? & Shigeru Kuratani? 


Cyclostomes, the living jawless vertebrates including hagfishes and lampreys, represent the most basal lineage of 
vertebrates. Although the monophyly of cyclostomes has been supported by recent molecular analyses, the 
phenotypic traits of hagfishes, especially the lack of some vertebrate-defining features and the reported endodermal 
origin of the adenohypophysis, have been interpreted as hagfishes exhibiting a more ancestral state than those of all 
other vertebrates. Furthermore, the adult anatomy of hagfishes cannot be compared easily with that of lampreys. Here 
we describe the craniofacial development of a series of staged hagfish embryos, which shows that their adenohypophysis 
arises ectodermally, consistent with the molecular phylogenetic data. This finding also allowed us to identify a 
pan-cyclostome pattern, one not shared by jawed vertebrates. Comparative analyses indicated that many of the 
hagfish-specific traits can be explained by changes secondarily introduced into the hagfish lineage. We also propose a 
possibility that the pan-cyclostome pattern may reflect the ancestral programme for the craniofacial development of all 


living vertebrates. 


Recent molecular phylogenetic data revealed that extant vertebrates 
consist of two monophyletic groups: the living jawless vertebrates 
(cyclostomes) and jawed vertebrates’ > (Fig. 1). Most extant vertebrates 
belong to the gnathostomes, which are characterized by the possession 
of a jaw and two nostrils® (Fig. 1i, j). In contrast, cyclostomes, com- 
prising only lampreys and hagfishes, lack a biting jaw and possess only 
a single nostril® (Fig. la-g). Morphologically, however, several traits in 
hagfishes have been interpreted as representing more ancestral states 
than found in all other vertebrates, including lampreys””. 


Hagfish 
a 


Lampray 
d 


Jawed vertebrates 


Figure 1 | Cyclostomes and gnathostomes. a-c, Eptatretus burgeri (hagfish). 
a, Dorsal view. b, Ventral view of a hagfish head showing a single nostril (en) 
and jawless mouth (mo). ¢, A diagram of the head anatomy. d-g, Lethenteron 
japonicum (lamprey). d, Lateral view. e, Ventral view to show the oral funnel 
surrounding the mouth. f, Dorsal view to show the dorsal nostril. g, A diagram 


In particular, the craniofacial pattern of hagfishes differs fun- 
damentally from that of lampreys, making it difficult to establish 
morphological homologies between the two cyclostomes**. Typi- 
cally, the nasohypophyseal duct ends blindly in lampreys, whereas 
that in hagfishes continues into the pharynx, exhibiting a conspicuous 
anatomical difference’? (Fig. 1c, g and Supplementary Fig. 1). Another 
impediment to the comparison is linked to whether the hagfish ade- 
nohypophysis is derived from the endoderm, similar to its potential 
homologue of amphioxus’’°'*”, or the ectoderm”’, like that of 


of the head anatomy. h, i, Scyliorhinus torazame (shark). h, Lateral view. 

i, Frontal view to show upper and lower jaws (uj and ]j), and paired nostrils (en). 
j, Diagram of the generalized gnathostome head. ah, adenohypophysis; et, 
ethmoidal region; Ilp, lower lip; ne, nasal epithelium; ons, oronasohypophyseal 
septum; T1-4, tentacles 1-4; ulp, upper lip. 


Department of Biology, Graduate School of Science, Kobe University, Kobe 657-8501, Japan. *Laboratory for Evolutionary Morphology, RIKEN Center for Developmental Biology, Kobe 650-0047, Japan. 
3Marine Research Station, Institute of Cellular and Organismic Biology, Academia Sinica, Yilan 26242, Taiwan. “Genome Resource and Analysis Unit, RIKEN Center for Developmental Biology, Kobe 650- 


0047, Japan. 


10 JANUARY 2013 | VOL 493 | NATURE | 175 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


lampreys and gnathostomes. The endodermal origin of the hagfish 
adenohypophysis was suggested by limited embryological data based 
ona few embryos that were fixed in a suboptimal manner’'°. Notably, 
the topographical relationship between the adenohypophysis and the 
oropharyngeal membrane has remained inconclusive”. 

The discrepancy between the morphology and molecular phyloge- 
netics may also be attributable to as-yet-undescribed developmental 
differences between these agnathans. Therefore, understanding the 
developmental embryonic patterns in hagfishes and lampreys is cri- 
tical for comparing the morphology between the two cyclostome 
groups and characterizing the body plan of hypothetical cyclostome 
ancestors, thus leading to a better understanding of the earliest evolu- 
tionary history of vertebrates. In this study, we incubated time-scaled 
fertilized eggs of the hagfish Eptatretus burgeri and collected serial 
staged embryos to re-examine the development and germ-layer origin 
of the adenohypophysis and craniofacial structures (Supplementary 
Fig. 2 and Supplementary Table 1). 

In lampreys and gnathostomes, the adenohypophysis and oronasal 
region are identified as ectodermal derivatives, arising between the 
stomodeal ectoderm and pharyngeal endoderm”. Similar structures 
also exist in early hagfish embryos, and the position of the oropharyn- 
geal membrane becomes critical for their germ-layer origins”’® (Fig. 2). 


Craniofacial development of E. burgeri 

During the mid-neurula (stage 25) of E. burgeri, the presumptive 
oropharyngeal membrane is anterior to the forebrain primordium 
(Fig. 2a, b), where the ectoderm and endoderm are juxtaposed with 
each other (Fig. 2b and Supplementary Fig. 3a). Depression of the 
stomodeum is enhanced with the growth of the forebrain by the late 


neurula (stage 28), creating the presumptive oronasohypophyseal 
cavity anterior to the oropharyngeal membrane (Fig. 2c, d and 
Supplementary Fig. 3). However, the oronasohypophyseal cavity is 
secondarily closed from the exterior by a pair of ectodermal folds 
arising on both sides of the forebrain, which grow rostromedially to 
fuse with its counterpart (arrows in Fig. 2c, e and Supplementary Fig. 
4a, b). This fusion finally forms a hagfish-specific membrane, the 
secondary oropharyngeal membrane (SOM)"* (Fig. 2f-i). The previ- 
ously reported endodermal origin of the hagfish oronasohypophyseal 
cavity was based on misidentification of the SOM as the oropharyn- 
geal membrane”® (Supplementary Fig. 4c); the hagfish oronasohypo- 
physeal cavity is located rostral to the true oropharyngeal membrane, 
which arises in a more caudal position (Fig. 2i-l). On the basis of the 
topographical relationship, the hagfish oronasohypophyseal cavity, 
including the adenohypophysis (light-blue in three-dimensional 
images, Fig. 2i-), is likely to be of ectodermal origin. 

To exclude the previously reported possibility that the endo- 
dermal pouch beneath the hypothalamus differentiates into the ade- 
nohypophysis in hagfishes’, we observed the development of the 
hypothalamus-associated structures caudal to the true oropharyngeal 
membrane. In stage 25 E. burgeri embryos, a thickening of the rostral- 
most part of the pharyngeal endoderm abuts the hypothalamus rostral 
to the notochord (pep (prechordal plate) in Fig. 2b), and at stage 26, 
this endodermal portion expresses EbHh1 and EbSix3/6A (pcp in 
Fig. 3a, c). On the basis of the topographical and molecular signatures, 
this thickening probably corresponds to the prechordal plate of other 
vertebrate embryos'”"” (Fig. 2i, j and Supplementary Fig. 5). Between 
stages 35 and 40, the prechordal plate forms a dorsally protruding 
pouch, reminiscent of Rathke’s pouch in gnathostomes (Supplementary 


Figure 2 | Craniofacial development of E. burgeri. a-f, Frontal (a, c, e) and 
lateral (b, d, f) views. The ectoderm is coloured light blue, the endoderm is 

coloured yellow, and the mesoendoderm is coloured orange. Arrows indicate 
the SOM. g, h, Sagittal sections at the plane represented in e. i-p, Lateral views 
(i, k, m, o) and sagittal sections (j, 1, n, p). Inset in p shows the differentiated 
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adenohypophysis (ah). e, eye; ht, hypothalamus; nhd, nasohypophyseal duct; 
nhp, nasohypophyseal plate; nt, notochord; oc, oral cavity; onc, 
oronasohypophyseal cavity; opm, oropharyngeal membrane; pcp, prechordal 
plate; p1, pharyngeal pouch 1; st, stomodeum. See Fig. 1 for other abbreviations. 
Scale bars, 100 um. Numbers indicate developmental stages. 


©2013 Macmillan Publishers Limited. All rights reserved 


Fig. 5a—c), adhering to the posterior aspect of the hypothalamus (Fig. 2i, j 
and Supplementary Fig. 5d—g). This pouch, however, becomes detached 
from the hypothalamus by stage 45 and disappears by stage 50 without 
differentiating into any overt structures, with the exception of a pair of 
lateral chords of cells growing from the pouch (Fig. 2k-n and Sup- 
plementary Fig. 5d-i). These chords are reminiscent of the premandi- 
bular mesoderm in embryos of other vertebrates’’; however, they also 
disappear before stage 45 (Fig. 2i, k). Thus, the rostral endoderm in 
hagfishes does not participate in the formation of the adenohypophysis. 

The ectodermal origin of the hagfish adenohypophysis is consistent 
with the gene expression patterns. In lampreys and gnathostomes, the 
molecular signature of the hypothalamic anlage, to which the adeno- 
hypophysis is adhered ventrally, is seen as a domain of Shh and Nkx2. 1 
expression posterior to the Six3/6-positive optic chiasma'*”’. The 
hypophyseal placode is defined by the expression of Pitx1/2 and 
Lhx3/4 (refs 20, 21). Similarly, in hagfish embryos, EbHh1 and 
EbNkx2.1 (putative orthologues of Shh and Nkx2.1) are expressed 
in the presumptive hypothalamus (Fig. 3a, b and Supplementary 
Fig. 6a, b), caudal to the optic chiasma that expresses EbSix3/6A, 
the putative hagfish orthologue of Six3/6 (Fig. 3c, d and Supplemen- 
tary Fig. 6c). Notably, the expression of EbPitxA and weak expression 
of EbLhx3/4A (putative Pitx2 and Lhx3/4 orthologues in hagfishes) 
were detected in the ectodermal thickening rostral to the oropharyn- 
geal membrane beneath the hypothalamus (Fig. 3e and Supplemen- 
tary Fig. 6d—f), which is topographically comparable to the hypophyseal 
placode of other vertebrates. Thus, the hagfish adenohypophysis is 
ectodermally derived, apparently based on gene regulations identical 
to those in gnathostomes and lampreys. 


Pan-cyclostome pattern of embryogenesis 

Unlike in gnathostomes, the hypophyseal and nasal placodes of the 
lamprey embryo form a single median complex, the nasohypophyseal 
plate (NHP)*” (Supplementary Fig. 7). In hagfish embryos, the puta- 
tive NHP is an ectodermal thickening beneath the forebrain (nhp in 


ie 


Figure 3 | Embryonic gene expression in Eptatretus burgeri. a-f, Expression 
of genes involved in the NHP development from stages 26 to 40. g, Dorsal view 
of three-dimensional-reconstructed image. The blue area shows pan-placodal 
domain. h, Schematic diagrams summarizing the gene expression in the 
embryos. In the graph, the hypothalamic domain is characterized by EbHh1, 
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Fig. 2d), which is itself bilaterally encircled by a horseshoe-shaped 
band of thickened ectoderm, reminiscent of the pan-placodal domain 
in other vertebrates’* (Fig. 3g). It was possible to follow the latter band 
caudally to the otic placode, and it also encompassed the putative 
precursors of cranial sensory ganglia”’ (Fig. 3g). 

The hypophyseal placode was identified in the posterior part of the 
NHP, whereas its anterior part, or the nasal placode homologue, 
expressed EbFgf8/17, a putative hagfish orthologue of Fef8 (Fig. 3f), 
which is expressed in the nasal placode in gnathostomes and 
the anterior NHP in lampreys*’**. Furthermore, EbSix3/6A and 
EbSoxB1 were expressed throughout (both anterior and posterior) 
the NHP of hagfish (Fig. 3c, d and Supplementary Fig. 6c, g, h), 
consistent with the expression patterns of Six3/6 and Sox2/3 in both 
the nasal and hypophyseal placodes in gnathostomes'*'*. The NHP 
takes the form of a single median placode through development 
(Fig. 3h), and finally differentiates into the adenohypophysis and 
unpaired nasal epithelium in a single nasohypophyseal duct, as 
in lampreys (Fig. 20, p). Thus, the NHP seems to be homologous 
between hagfishes and lampreys. 

The homology of the NHP between hagfish and lampreys implies 
that the overall craniofacial organization of cyclostomes would also be 
conserved. At mid-pharyngular stages of cyclostomes, the NHP is 
rostrocaudally bordered by ventral growths of ectomesenchymal pro- 
cesses, with the anterior nasal process rostral to the NHP and the post- 
hypophyseal process between the NHP and prechordal plate” 
(Fig. 4a, e and Supplementary Fig. 7). Both in hagfishes and lampreys, 
the post-hypophyseal process initially arises as a pair of lateral 
processes at the opening of the oronasohypophyseal cavity (Fig. 4b, 
c, f, g), which coalesce at the midline to form a shelf behind the NHP, 
separating the nasohypophyseal duct and oral cavity (Fig. 4d, h). In 
the late pharyngula of hagfishes, the rostrolateral portion of the post- 
hypophyseal process divides into four pairs of tentacles (Fig. 4d), 
whereas the equivalent process of lampreys forms a dorsal oral hood, 
or the upper lip of the ammocoete larva (Fig. 4h). The peripheral 
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EbNkx2.1 and EbSix3/6 expression. Adenohypophysis-like expression (blue 
bars) is restricted in the ectodermal domain rostral to and including the 
oropharyngeal membrane (opm). ch, chiasma; fb, forebrain; hb, hindbrain; mb, 
midbrain; ot, otic placode. See Figs 1 and 2 for other abbreviations. Scale bars, 
100 um. 
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distribution patterns of the trigeminal nerves are consistent with the 
above comparison, indicating that a subset of hagfish tentacles (T1, 
T3, T4) are comparable with the lateral wall of the lamprey upper lip 
and T2 is comparable with the dorsal medial roof of the upper lip 
(Supplementary Fig. 8). The oral roof beneath the nasohypophyseal 
duct is homologous between hagfishes and lampreys””’ (Fig. 41), but 
it is not comparable to the pattern of jawed vertebrates (Fig. 4i-l and 
Supplementary Fig. 9). 

During the developmental stages when lamprey and hagfish 
embryos show the most conspicuous resemblance to each other in 
the oronasal region, other craniofacial portions of the embryos also 
show similarities, most of which can be recognized as vertebrate- 
specific features*’. For example, EbPax1/9 was expressed in a series 
of pouches on the lateral side of the pharynx (Supplementary Figs 10a 
and 11a), whereas EbTbx1/10A was expressed in the mesenchymal 
core in each space between two successive developing pouches (Sup- 
plementary Figs 10b and 11a), consistent with the expression patterns 
of Pax1/9 and Tbx1/10 in pharyngeal pouches and the mesodermal 
core of pharyngeal arches in other vertebrate embryos, including 
those of lampreys**”’. We also noted post-otically developing somites 
and typical rhombomeric segmental patterns on the hindbrain of 
hagfishes, as well as the formation of cranial nerve roots on even- 
numbered rhombomeres (Supplementary Fig. 12a, b). In both of the 
cyclostomes, the first pharyngeal mesoderm (mandibular mesoderm) 
and pouches form the muscle anlage of the post-hypophyseal process/ 
velum and velar chamber’’ (Supplementary Fig. 1la—d, f-m). By the 
late pharyngula, the second pharyngeal pouch of lampreys forms the 
rostral-most gill pore (Supplementary Fig. 11m), whereas that of 
hagfishes is derived from pouch 4 because pouches 2 and 3 are dege- 
nerated secondarily in hagfishes (Supplementary Fig. 11h-j). Never- 
theless, the pharyngeal mesoderm belonging to arches 2-4 remain in 


Figure 4 | Comparison of vertebrate heads. a—h, Mid-sagittal sections 

(a, e) and ventral views (b-d, f-h) of Eptatretus burgeri (a, stage 45; b, stage 40; 
c, stage 50; d, stage 53; SOM has been removed from b-d) and Lethenteron 
japonicum (e, stage 26; f, stage 23; g, stage 26; h, stage 27). i-k, Ventral views of 
Scyliorhinus torazame (i, stage 20; j, stage 25; k, stage 28). The premandibular 
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the original site, innervated by the equivalent cranial nerves (Sup- 
plementary Fig. 12c-g). Thus, the embryonic morphology of the 
pharyngeal structures in hagfishes also reflects the basic segmental 
pattern of pharyngeal arches generally found in vertebrates. In the late 
pharyngula of hagfishes, however, secondary modification of cranio- 
facial structures results in the anatomical patterns unique to hagfishes. 

In the late hagfish embryo, the oral cavity and nasohypophyseal 
duct grow caudally relative to the position of the adenohypophysis 
(blue and black arrows, Supplementary Fig. 11f-j), and the pharynx 
elongates posteriorly*' (white arrow, Supplementary Fig. 1la-j). 
During these processes, the presumptive nasohypophyseal duct, 
velum and velar chamber recline inwards to take an oblique orienta- 
tion unlike that of lamprey, which is positioned more vertically 
(Fig. 2i, k, m, o and Supplementary Fig. 11f-j). The gill apparatus also 
shifts caudally to the mid-trunk level (white arrow and sm (somite) in 
Supplementary Fig. 11a-e). In addition, the posterior end of the naso- 
hypophyseal duct opens into the pharynx by the rupture of the caudal 
root of the post-hypophyseal process-derived oronasohypophyseal 
septum (compare Fig. 2m, n to Fig. 20, p). Thus, unlike the nasohy- 
pophyseal duct of lampreys, which ends blindly, the duct in hagfishes 
opens secondarily into the pharynx. These hagfish-specific morpho- 
logical modifications are what differentiate adult hagfishes and lam- 
preys at the anatomical level (Supplementary Fig. 1). Alternatively, 
the overall embryonic features indicate that the larval morphology in 
lampreys is at a relatively less differentiated state compared with that 
of hagfishes. 


Discussion 

Our re-interpretation of the ectodermal adenohypophysis (Fig. 2) 
refutes the previous hypothesis that the craniofacial structures of 
hagfishes represent a more ancestral state than those of all other 


E. burgeri 


S. torazame 


domain (asterisk) is comparable to the post-hypophyseal process (php) of 
cyclostomes. The blue domain (lateral and medial nasal prominences 
(fnp+Inp)) resembles the anterior nasal process (anp) of cyclostomes (ref. 22). 
1, Generalized patterns of vertebrate heads. ma, mandibular arch; np, nasal 
placodes. See Figs 1 and 2 for other abbreviations. Scale bars, 100 tm. 
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vertebrates. This hypothesis, together with the concept of the “endoder- 
malized head’ of hagfish’*, was already called into the question by 
reports that the hagfish adenohypophysis is structurally and function- 
ally similar to that of other vertebrates**”’. Also, many of the vertebrate- 
defining characters once believed to be absent from hagfishes, including 
lens, spleen, vertebrae and a lateral line system, are now known to deve- 
lop transiently or in primitive state in hagfish adults and embryos*** °°. 
In addition, there are even some cyclostome-specific synapomorphies 
at the morphological level as well, including the rostrally migrating 
myotomes, velum and tongue apparatus’, that support the mono- 
phyly of cyclostomes (Fig. 5). Thus, the morphological status of hag- 
fishes is reasonably consistent with the molecular phylogenetic data’. 

Hagfishes and lampreys share embryonic craniofacial features such 
as the NHP and two processes (and their derivatives) anterior and 
posterior to the NHP (Fig. 5). This pattern never appears during any 
developmental stage of living gnathostomes. The adenohypophysis 
and nasal organ are adjacent to each other anteroposteriorly in the 
cyclostomes” (Fig. 5). In contrast, the adenohypophysis becomes 
separated from the paired nostril and is secondarily incorporated into 
the oral ectoderm in gnathostomes (Fig. 5). The possible single naso- 
hypophyseal opening is shared with the basal ‘ostracoderms’ (stem 
gnathostomes), the anaspids®’*”*, such that the NHP can be inter- 
preted as an attribute of the monorhiny of the basal agnathans and 
thus the ancestral state for vertebrates (Fig. 5). 


Hagfish 


Caudal shift of posterior pharynx 
Formation of SOM 

Caudally open nasohypophyseal duct 
Secondary loss of lateral lines, lens 
etc 


Mandibular arch-derived tongue 
Preotically shift of rostral myotomes? 
Velum? 

Cyclostomes 


en ne ah Lamprey 


al 


mo ba lip 
aNP php php 
Pan-cyclostome 
(= pan-vertebrate?) 
pattern 


Other stem gnathostomes 


Gnathostomes Galeaspids 


Separation of nasal —— 
+ hypophyseal placodes 
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Adenohypophysis 
opening in the oral cavity 
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Crown gnathostomes 


Paired nostrils 
Intertrabecula 
Trabecula 
Jaws 
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ah tr 


(SS 
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Figure 5 | Evolution of the vertebrate head. A model based on the 
assumption that the pan-cyclostome pattern (in box) represents a 
plesiomorphic pattern for the entire vertebrates. The hagfish 
oronasohypophyseal septum (ons) and the lamprey upper lip (ulp) are both 
derivatives of the post-hypophyseal process (php). Separation of the NHP into 
the nasal epithelium (ne) and adenohypophysis (ah) may have led to the pattern 
of galeaspids”. Further bilateral separation of the nasal epithelia is regarded asa 
prerequisite for acquisition of the trabecula (tr) and upper and lower jaws (uj 
and lj) in crown gnathostomes. ph, pharynx. See Fig. 1, 2 and 4 for other 
abbreviations. 
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After the establishment of the cyclostome developmental pattern, 
hagfish embryos go through enormous morphological changes, includ- 
ing the caudal shift of pharyngeal structures (Fig. 5 and Supplementary 
Fig. 1la—j) and opening of the nasohypophyseal duct into the pharynx 
(Figs 2m-p and 5). For these hagfish-specific, late-embryonic modifi- 
cations, anatomical comparison of lampreys and hagfishes is difficult, 
and lampreys and gnathostomes used to be regarded as more closely 
related to each other*'!”*, The autoapomorphic nature of hagfish mor- 
phology, especially in the craniofacial region, is consistent with a fossil 
hagfish, Myxinikela siroka, with oropharyngeal structures positioned 
more rostrally than in the living hagfishes, which can be regarded as 
representing an intermediate transitional stage*’. Although the mor- 
phology of the nasohypophyseal duct in M. siroka remains unclear, the 
medial subnasal cartilage (T2) beneath the nasohypophyseal duct, at 
least, is located more dorsally than the lateral labial cartilage (T1)””. 
Thus, hagfishes are likely to be a highly derived lineage of cyclostomes. 

Without suitable outgroups, the evaluation of the proposed pan- 
cyclostome pattern is not easy: if this was the pattern of the common 
ancestor of cyclostomes, could gnathostomes have evolved by the 
modification of this same pattern (Fig. 5)? Recent studies have sug- 
gested that, even before acquisition of the jaw, major components of 
the vertebrate head were re-organized in gnathostome lineages”. In 
the lineage of gnathostome, evolutionary polarity from single to dou- 
ble nostrils is apparent. If the monorhiny in stem gnathostomes 
shared the same origin as that of cyclostomes, that would be highly 
indicative of NHP in their embryos. In this context, two informa- 
tive groups of fossil agnathans are galeaspids and osteostracans, the 
endoskeletons of which are well ossified. In particular, the cranial 
morphology of osteostracans has traditionally been compared with 
the lamprey with its single nasohypophyseal duct that opens dorsally*". 
However, a recent observation of galeaspids, a lineage basal to the 
osteostracans™, has suggested that their adenohypophysis was sepa- 
rated from the nasal sac and opened into the oral cavity, as in jawed 
vertebrates”. If this animal truly represents a transition from mono- to 
diplorhiny states, it would represent an intermediate condition towards 
the crown gnathostomes®*” (Fig. 5). Thus, the lamprey-like morphology 
of osteostracans would represent a secondary condition. Even so, the 
embryonic development of galeaspids may have initially passed through 
the cyclostome pattern, yet with a widely expanded NHP that preceded 
its break-up into nasal and hypophysial placodes, as seen in jawed 
vertebrates. Thus, the pan-cyclostome pattern may possibly represent 
a symplesiomorphic ancestral state of the craniofacial development of 
the vertebrates. If so, the major part of the gnathostome craniofacial 
evolution could possibly be seen as the loss and modification of the 
cyclostome-like ancestral patterns, especially characterized by the 
acquisition of diplorhiny, upon which the evolution of the prechordal 
cranium and jaw depended*” (Fig. 5). 


METHODS SUMMARY 


Eptatretus burgeri embryos were collected as described previously’, staged 
according to ref. 44 (Supplementary Table 1), and fixed with Serra’s fixatives or 
4% paraformaldehyde. Sections were made by Kawamoto’s protocol (Section Lab 
Co. Ltd). Haematoxylin and eosin staining followed standard protocols. Images 
were recorded with a DP70 digital camera (Olympus Corporation) attached to a 
light microscope and reconstructed with Avizo (Visualization Sciences Group). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Sample collection and aquarium maintenance of hagfish. Eptatretus burgeri 
embryos were collected as described previously”’, staged according to ref. 44 (Sup- 
plementary Table 1), and fixed with Serra’s fixatives or 4% paraformaldehyde. 
cDNA cloning and sequencing. EbPitxA, EbSix3/6A, EbFgf8/17, EbHh1, 
EbTbx1/10A, EbSoxB1, EbLhx3/4A and EbNkx2.1A fragments were amplified 
by degenerate PCR with reverse transcription and isolated by the TOPO TA 
cloning kit dual promoter (Invitrogen). The sequence data were submitted 
to the DDBJ database (AB703678-AB703682, AB729075-AB729076, and 
AB747372). To confirm the orthology of newly identified hagfish genes, homo- 
logous sequences were retrieved from the NCBI Protein database and Ensembl 
Genome Browser, and multiple sequence alignments were generated using the mul- 
tiple alignment program MAFFT™, followed by visual inspection and manual cura- 
tion. On the basis of these alignments, molecular phylogenetic trees were inferred 
with the neighbour-joining and maximum-likelihood methods (data not shown). 
Histology of hagfish embryos. To avoid distortion of histological sections and 
particularly to preserve the oropharyngeal membrane, Kawamoto’s Film Method 
was employed using the Paraffin Section Preparation kit (Section Lab Co. Ltd; see 
http://section-lab.jp/English.htm). The adhesive side of the Paraffin Transfer 
Film (fine type) was attached firmly onto the cut surface of the paraffin block, 
and each section was cut slowly at 6-8 jum thick with a microtome. One side of the 
cut ‘film-sections’ was firmly placed onto a glass slide with the sectioned side 
down, and the slide was bathed in water at 45°C for 10s. The films were then 
pressed and attached on the glass slide with a rubber roller, and dried on the hot 
plate at 50 °C for 24h to transfer the embryonic tissues onto the slides. The 
adhesive compound and paraffin were removed with xylene for 1-3 days before 
histological, immunohistochemical and in situ hybridization procedures. Images 
were recorded with a DP70 digital camera (Olympus Corporation) attached to a 
light microscope and reconstructed with Avizo (Visualization Sciences Group). 
In situ hybridization of hagfish. In situ hybridization was performed either by 
using a manual standard protocol or a Ventana automated instrument (Roche 
Diagnostics). In the standard protocol, serial sections were fixed for 10 min in 4% 
paraformaldehyde (PFA) in phosphate-buffered saline (PBS) at room tempera- 
ture, washed twice in PBS, treated with ProK in 0.01 M Tris buffer for 10 min, and 
fixed again for 10 min in 4% PFA at room temperature. After rinsing twice in PBS, 
the sections were incubated with 0.25% acetic anhydride and 0.1 M triethanola- 
mine (pH 8), washed in PBS, air dried, and hybridized at 51°C for 16-20 h. The 
sections were then washed in 5X SSC at 55 °C, treated with 50% formamide in 
2X SSC at 60°C for 20 min, washed in 2X SSC and twice in 0.2 SSC at 60 °C 
each for 20 min. After blocking with 1.5% blocking reagent (Roche Diagnostics) 
in 0.1 M Tris buffer with 0.15 M NaCl (pH 7.6), the sections were incubated with 
alkaline phosphatase-conjugated anti-DIG antibody (Roche Diagnostics). After 
the final washes with Tris buffer, positive cells were stained purple with nitroblue 
tetrazolium salt (NBT) and 5-bromo-4-chloro-3-indolyl phosphate toludinium 
salt (BCIP). When using the Ventana instrument, detection of signals and coun- 
terstaining were performed by using the BlueMap NBT/BCIP substrate kit and a 
nuclear fast red-equivalent reagent, ISH RED, was applied (Roche Diagnostics), 
as described previously*’. 

Sample collection and aquarium maintenance of lampreys. Mature male and 
female lampreys, Lethenteron japonicum, were collected from the Miomote River 
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in Niigata and the Shiribetsu River in Hokkaido, Japan, during the breeding 
season (early June). Eggs were fertilized artificially and incubated in 10% 
Steinberg’s solution at 16-23 °C (ref. 46). Lamprey embryos were staged accord- 
ing to Tahara’s staging of Lethenteron reissneri”’, a brook lamprey species closely 
related to L. japonicum’, and fixed in 4% PFA/PBS overnight at 4°C. The 
embryos were gradually dehydrated in a graded series of methanol/PBS. 
Finally, they were stored in 100% methanol at —20 °C. 

Histology of lamprey embryos. Fixed lamprey embryos stored in methanol 
(—20°C) were placed in dimethylsulphoxide (DMSO) and methanol (1:1). 
After washing with Tris-HCl-buffered saline (TST) containing 5% DMSO 
(TSTd), the embryos were blocked with aqueous 1% periodic acid and 5% non- 
fat dry milk in TSTd (TSTM). The embryos were incubated in the primary 
antibody (anti-acetylated tubulin (Sigma) diluted 1:1,000 in TSTM) for 2-4 days 
at room temperature. After washing with TST, the samples were incubated with 
Alexa Fluor 488 goat anti-mouse IgG (H+L) secondary antibody (diluted 1:1000 
in TSTM; Molecular Probes). After the final wash in TSTd, the embryos were 
dehydrated and clarified with a 1:2 mixture of benzyl alcohol and benzyl benzo- 
ate, in which cell and nucleic membrane were counterstained with membrane- 
bound fluorescent dye (Cell Tracker CM-Dil; Invitrogen) to be observed with a 
confocal microscope (LSM710; Zeiss, Inc.). 

Scanning electron microscopy of lamprey embryos. Embryos were fixed with 
2.5% glutaraldehyde in 0.01 M PBS, washed in 0.01 M phosphate buffer, and post- 
fixed with 4% osmium tetraoxide. The embryos were then rinsed in phosphate 
buffer, dehydrated with a graded series of ethanol, and freeze-dried with t-butyl 
alcohol. The embryos were placed on an aluminium stub, sputter-coated with 
gold—palladium alloy, and viewed with a scanning electron microscope (JSM- 
5800, JEOL), as described previously”. 

In situ hybridization of lamprey. Whole-mount in situ hybridization of lam- 
prey, L. japonicum, was performed as described previously”, with minor modi- 
fications. Hybridization and post-hybridization washes were performed at 65 °C 
to avoid nonspecific cross-hybridization among paralogues. 

Sample collection and aquarium maintenance of shark. Fertilized eggs of 
Scyliorhinus torazame were obtained from adult animals that were bred at 
16 °C in seawater tanks. Shark embryos were staged according to Ballard’s staging 
of Scyliorhinus canicula, a species closely related to S. torazame'””. 
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GRADUATE STUDENTS 


Structured study 


European institutions are introducing organized doctoral 
programmes that broaden students’ education. 


BY AMBER DANCE 


athy Kennedy faced a decision six 

k months into her PhD. Having spent 

time in various labs studying para- 

sitic liver flukes and the intestinal bacterium 

Clostridium difficile, she had to choose where 

to spend the next three-and-a-half years while 

completing her thesis. The rotations helped her 

not only to make an educated decision about 

which research group to join, but also to meet 
colleagues and learn extra skills. 

Five or ten years ago, her experience would 
have been unusual in Europe, where doctoral 
students would find a mentor, then imme- 
diately join his or her research group. They 
rarely had classes or a choice of research top- 
ics. “I went into the lab until I was done; recalls 
Christine Loscher, who directs the Bioanalysis 
and Therapeutics Structured PhD Programme 
(BioAT) at Dublin City University, and is 
Kennedy’s chosen adviser. 

A doctorate in Europe traditionally involved 
a one-on-one relationship between master and 
apprentice. In Germany, for example, the word 
for that influential mentor is Doktorvater — 
‘doctor father. Pursuing related interests and 
coursework was not generally on the agenda. 
PhD holders usually went on to make their 
careers in academia, so the Doktorvater was 
training his or her replacement. 

Kennedy’s experience reflects a new kind of 
doctorate. A structured PhD generally includes 
the research training of a conventional degree, 
together with other elements that keep stu- 
dents on track and help to prepare them for 
a variety of post-PhD careers. Students have 
multiple advisers and administrators to guide 
them, receive formal training in scientific and 
soft skills and have opportunities for travel 
and interdisciplinary study. In principle, they 
receive a broader education than any single 
supervisor could provide. 


NEW-LOOK DOCTORATES 
The rise of structured PhDs has been rapid. 
In 2007, just over one-quarter of European 
institutions had established structured doc- 
toral programmes; by 2010, it was almost two- 
thirds, according to the European University 
Association (EUA) in Brussels’. And whereas 
in 2007 almost half the institutions offered for- 
malized classwork — usually taking the form 
of short modules that ran for a day or two — 
72% did so in 2010. 

In many ways, Europe’s move towards 
structured PhD programmes is making the 
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continent’s doctoral education more like that in 
the United States, where students apply to pro- 
grammes rather than to principal investigators, 
and undertake coursework and rotations before 
beginning their theses. According to a survey” 
of 160 institutions released in October 2012, 
more than 90% of North American universi- 
ties offer structured 
PhDs. But differences 
remain: US PhDs tend 
to take longer — most 
students graduate 
in seven years, com- 
pared with four in 
the European Union 
(EU) — because they 
include the training 
that European stu- 
dents typically get in 


a separate master’s “In the 

degree. EU pro- traditional 
grammes also tend to mode L it was 

put moreemphasison JUST assumed 
broad skill sets likely thatifyou’rea 

to be usefulinnon- goodresearcher, 
academic positions. youw’re a good 


In Europe, the 
implementation of 


supervisor.” 
Thomas Jorgensen 


‘structure’ varies 

widely. In some places, such as France and 
the United Kingdom, structured programmes 
are already the norm. Others cling to the con- 
ventional model or have not yet implemented 
structured PhDs. Even within a nation, some 
institutions or departments may offer struc- 
tured programmes whereas others do not. 


GATHERING SKILLS 

Now in the second year of her PhD, Kennedy 
has taken classes in presentation skills, statis- 
tics and scientific writing, as well as modules 
more specific to her research topic: the immune 
system’s recognition of Clostridium surface pro- 
teins. BioAT is a joint effort between six Irish 
institutions, and Kennedy will interact with 
colleagues in other disciplines, institutions and 
countries. She will meet formally with Loscher 
and a second adviser twice a year to make sure 
that her work is on track. She will also have the 
chance to study intellectual property and com- 
mercialization — which will come in handy if 
she, like many PhD students, ends up pursuing 
a career in industry. “You become a more well- 
rounded student because all these different 
areas are covered,’ she says. 

Candidates who have trained in structured 
programmes, and taken classes on topics such 
as patent searches and intellectual property, 
can hit the ground running in an industry job, 
says Declan Moran, a manager at Ipsen, a phar- 
maceutical company in Dublin. 

And although such extra course require- 
ments may mean a greater commitment to the 
classroom, one’s research project need not be 
neglected, say advocates. Ideally, the classes 
and programme together provide a framework 


for students’ main studies. “This does not 
diminish the PhD itself as an original piece 
of research,” says Conor O’Carroll, research 
director at the Irish Universities Association 
in Dublin. “It’s to enhance the traditional PhD” 

There is some early evidence that it is work- 
ing. O'Carroll conducted surveys of a total of 
1,455 PhD trainees across seven Irish universi- 
ties in 2008, 2009 and 2010 (ref. 3). Even in that 
short period, students’ experiences changed as 
more of them entered structured programmes. 
For example, the number of students who had 
received detailed timetables at the start of their 
PhDs rose by 20% between 2008 and 2010. Stu- 
dents also became 25% more likely to discuss 
their research projects in detail with their advis- 
ers at the start of their programmes, and there 
was a 20% increase in student satisfaction with 
the availability of their mentors. The number 
of students with more than one supervisor rose 
by 18%. All these factors contribute to a happier 
student, says O’Carroll. 

Structured programmes also boosted 
candidates’ scientific output, he found: their 
students were 6% more likely to publish a 
paper than were those in conventional train- 
ing. They were also 5% more likely to present 
their work at an international meeting. 


UNEXPECTED BENEFIT 
Students come out of structured programmes 
with a broad set of skills — some of which they 
may not have known they needed. When she 
began her PhD in biochemistry at the Free 
University of Berlin, Agnieszka Denkis was 
fairly sure that she wanted an academic career. 
But three years later, she is considering enter- 
ing industry to do 
research with real- 
world applications. 
She doesn't have to 
look far for informa- 
tion on how to make 
the transition — her 
programme offers 
workshops on indus- 
trial careers. 

Denkis is also 
developing transfer- 


able skills outside “You have to 
official modules. She like what you’re 
has helped to organize doing . You’re 
asymposium, getting going to be 


married to it.” 
Sarah-Jo Sinnott 


practice in fund-rais- 
ing and inviting pre- 
senters. And through 
teaching, she has worked on her public speak- 
ing. Trainees in conventional PhDs can also 
seek out these opportunities, but structured 
programmes organize them and make them 
easier for students to find. 

Students in structured PhD programmes are 
generally not dependent on a single supervi- 
sor: Sarah-Jo Sinnott, who is in her third year 
of the Health Services Research programme 
at University College Cork in Ireland, has 
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four. “I seem to collect them,” she jokes. When 
she started her studies, she was assigned one 
mentor, who studies dental public health. As 
she developed her thesis topic, a study of new 
insurance co-payments for prescriptions in 
Ireland, she recruited a lecturer in pharmacy 
studies and two economists. Her programme 
is a multi-institutional effort, so she can receive 
guidance from faculty members at University 
College Cork, Trinity College Dublin and the 
Royal College of Surgeons in Ireland, also in 
Dublin. 

Having a diverse committee means that 
trainees have someone to go to if they fall out 
with their primary advisers. They can also get 
multiple references after they graduate. 

In some programmes, the supervisors, too, 
receive structured training. “In the traditional 
model, it was just assumed that if you're a good 
researcher, you're a good supervisor,’ says 
Thomas Jorgensen, senior programme man- 
ager for the Council for Doctoral Education at 
the EUA. These days, by contrast, Irish faculty 
members can attend workshops in mentoring. 
Swedish advisers are often required to pass a 
test on rules such as how much time they must 
set aside for advising. Other new supervisors 
may simply share a coffee with a more seasoned 
mentor to get some advice, says Jorgensen. 

Through classes, rotations and social events 
that bring together students working on dif- 
ferent research topics, structured programmes 
facilitate networking that might not other- 
wise be available to students. That enhanced 
interaction pays dividends, says Eric Haertel, 
a second-year PhD student in the Molecular 
Life Sciences programme at the Swiss Institute 
of Technology (ETH) in Zurich. Trainees from 
ETH and the University of Zurich, which is a 
partner in the programme, get together for bar- 
becues and to drink tea, and attend an annual 
retreat with student presentations and a party. 
Without the programme, Haertel wouldn't 
know anybody at the university, he says. The 
most valuable element of those social events 
is chatting with other students about his and 
their work, and solving problems together. And 
simply knowing who else works on the same 
topic can be useful, says Haertel, when he runs 
out of reagents and needs to borrow something. 

The networking of PhD candidates helps 
their advisers, too, because students can bring 
back valuable information from other labs, 
or can start collaborations, says Loscher. For 
example, during one of her rotations, Kennedy 
performed luciferase assays, which measure 
the expression of genes linked to the firefly 
enzymes that produce light; now she can teach 
the technique to the rest of Loscher’s lab group. 


ACROSS BORDERS AND SECTORS 

Many structured programmes promote net- 
working across national and disciplinary bor- 
ders. Ilaria Alborelli entered a programme at 
the University of Basel in Switzerland with an 
eye towards a project in fruitfly development. 


But she broadened her experience through 
rotations in neurobiology and stem-cell 
labs. The programme also includes funding 
for international travel. “It’s good to expe- 
rience research in different environments, 
says Alborelli. Making it easy for students to 
travel also helps them to reach the best labs 
for their research, adds O’Carroll. 

The process of applying for a structured 
PhD varies with the programme. In the 
conventional model, trainees found a pro- 
fessor to take them on and joined that lab. 
In some universities, this still happens, but 
the principal investigator then encour- 
ages the student to join an appropriate 
programme. 

In other cases, applicants approach the 
programme first, and faculty members 
jointly decide whether their qualifications 
merit an interview. During the interview 
visit, applicants meet several faculty mem- 
bers and look for a good match; ifa profes- 
sor and student want to work together, then 
the student is admitted. And in some pro- 
grammes, students do not even decide ona 
mentor until they have started their degrees 
and completed their rotations. This ‘group 
application’ process helps faculty mem- 
bers too, says O’Carroll. Instead of picking 
through individual students, they can let 
the programme select the best candidates. 

Structured studies may not be the best fit 
for everyone, says Sarah Weigelt, a postdoc 
at the University of Miinster in Germany, 
who will start a job as a junior professor 
at the University of Bochum in April. 
“It’s quite dependent on the personality,” 
she says. Some trainees thrive in a clearly 
defined curriculum, but others know 
exactly what research they want to do, and 
Weigelt wouldn't want them to be delayed 
by structured coursework. She would pre- 
fer to see universities offer a range of PhD 
options so that students can choose their 
ideal path. 

Although picking the right sort of pro- 
gramme matters, the key factor in success 
may still be finding the right research pro- 
ject. “You have to like what you're doing,” 
says Sinnott. “You're going to be married 
to it” m 


Amber Dance is a freelance science writer 
in Los Angeles, California. 
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TURNING POINT 
Rui Costa 


Rui Costa, a neuroscientist at the 
Champalimaud Center for the Unknown 

in Lisbon, accepted his latest accolade last 
October: a US$15,000 Young Investigator 
Award from the Society of Neuroscience in 
Washington DC, for his innovative approach 
in detailing the brain circuits involved in 
learning. Trained as a veterinary surgeon, 
Costa traces his success to a 1997 New Year's 
resolution to follow his research muse. 


Has animal behaviour always interested you? 
Yes; as a kid, I loved watching natural-history 
television programmes. In my last year of vet- 
erinary school, I did a research internship at 
the Swedish University of Agricultural Sci- 
ences in Skara, in which I studied suckling 
behaviour in cows and dolphins. I wanted to 
understand the origins of behaviour and real- 
ized that I would have to study the brain. 


What do you consider the most pivotal 
moment in your career? 

I worked as a dairy-cow vet for one year after 
getting my doctorate of veterinary medicine at 
the Technical University of Lisbon. But I knew 
that I had to follow my passion for research. 
On New Year’s Eve of 1997, I quit. I had no 
job and no salary, but I wanted so badly to be 
a researcher that there was no question that I 
would find a way to make it work. It sounds 
crazy, but putting yourself in a position of not 
going back frees you to pursue your passion. 


How did you end up in the United States at the 
University of California, Los Angeles (UCLA)? 

I found a government-sponsored programme 
at the University of Porto in Portugal, in which 
students took a year of classes and could then do 
their research at any university that would fund 
them. I decided to contact memory researcher 
Alcino Silva, who was moving to UCLA. 


Why did you choose Silva? 

He had done the first genetic manipulation of 
the mouse brain. I believed that looking at brain 
circuits was the best way to study genetics and 
behaviour, so I e-mailed him. It wasn't my tech- 
nical skills that convinced him to take me on, so 
it must have been my passion for the research. 


Were the experiments that you wanted to 
conduct technologically feasible? 

Not really. I have always wanted to image brain 
activity while genetically manipulated animals 
were learning and behaving, but the technol- 
ogy didn’t exist then. Even today, we can still 
image only the surface of the brain. But I knew 
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that Miguel Nicolelis, a neurobiologist at Duke 
University in Durham, North Carolina, was 
recording many primate neurons at once using 
hundreds of electrodes. Looking for a postdoc 
position, I wrote to him and said that I would 
love to modify this technique to record activ- 
ity in mouse brains. He immediately replied 
that he had had the same idea, and called a 
few minutes later. He used a phrase in Brazil- 
ian Portuguese that means, “Let's go ahead, 
because people are coming from behind”. 
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Did you always intend to return to Portugal? 

I married an American and I liked being in the 
United States. Yet I missed my family in Portu- 
gal and I felt that because Portugal had given 
me this opportunity, I would one day return. 
However, I didn’t want to just go back and 
not do science. In 2004, Anténio de Sommer 
Champalimaud, a Portuguese entrepreneur, 
died and willed roughly one-third of his fortune 
to launching the Champalimaud Foundation. 
The foundation decided to start a neuroscience 
programme at the Champalimaud Center for 
the Unknown, with two other researchers and 
me. In 2007, we shook hands to seal the deal. 


How has the experience been so far? 
Everything has gone smoothly. The building 
has an excellent view of the sea and a huge 
open lab space. We were able to recruit great 
talent from around the world and have grown 
to 15 labs, with 130 people. When I arrived, I 
got a big grant from the European Research 
Council; last year, I received a 5-year Interna- 
tional Early Career Scientist award from the 
Howard Hughes Medical Institute in Chevy 
Chase, Maryland. The centre has had a lot of 
press and publicity, so there is a lot of hype 
about the place — which helped with recruit- 
ment. Now we need to start publishing results 
to show the public what we are doing. m 


INTERVIEW BY VIRGINIA GEWIN 
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